Single-chain insulin agonists exhibiting high activity at the insulin receptor

ABSTRACT

Single chain insulin analogs are provided having high potency and specificity for the insulin receptor. As disclosed herein optimally sized linking moieties can be used to link human insulin A and B chains, or analogs or derivatives thereof, wherein the carboxy terminus of the B25 amino acid of the B chain is linked to the amino terminus of the A1 amino acid of the A chain via the intervening linking moiety. In on embodiment the linking moiety comprises a polyethylene glycol of 6-16 monomer units and in an alternative embodiment the linking moiety comprises a non-native amino acid sequence derived form the IGF-1 C-peptide and comprising at least 8 amino acids and no more than 12 amino acid in length. Also disclosed are prodrug and conjugate derivatives of the single chain insulin analogs.

CROSS REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. application Ser. No.13/701,146, filed Nov. 30, 2012, which is a 371 nationalization ofPCT/US2011/040699, filed Jun. 16, 2011, which claims priority to U.S.Provisional Patent Application Nos. 61/355,366 and 61/433,500, filedJun. 16, 2010 and Jan. 17, 2011. The entire disclosures ofPCT/US2011/040699, U.S. Ser. No. 61/355,366 and U.S. Ser. No. 61/433,500are hereby incorporated by reference in their entirety.

INCORPORATION BY REFERENCE OF MATERIAL SUBMITTED ELECTRONICALLY

Incorporated by reference in its entirety is a computer-readablenucleotide/amino acid sequence listing submitted concurrently herewithand identified as follows: One 90 KB ACII (Text) file named“SubSeqlist_singleST25” created on Mar. 29, 2016.

BACKGROUND

Insulin is a proven therapy for the treatment of juvenile-onset diabetesand later stage adult-onset diabetes. The peptide is biosynthesized as alarger linear precursor of low potency (approximately 2% to 9% of nativeinsulin), named proinsulin. Proinsulin is proteolytically converted toinsulin by the selective removal of a 35-residue connecting peptide (Cpeptide). The resultant heteroduplex formed by disulfide links betweenthe insulin “A chain” (SEQ ID NO: 1) and “B chain” (SEQ ID NO: 2) chain,representing a total of 51 amino acids, has high potency for the insulinreceptor (nM range). Native insulin has approximately one hundredfoldselective affinity for the insulin receptor relative to the relatedinsulin-like growth factor 1 receptor, but demonstrates littleselectively for the two different insulin receptor isoforms, named A &B.

The insulin-like growth factors 1 and 2 are single chain liner peptidehormones that are highly homologous in their A and B chain sequences,sharing approximately fifty percent homology with native insulin. TheIGF A and B chains are linked by a “C-peptide”, wherein the C-peptidesof the two IGFs differ in size and amino acid sequence, the first beingtwelve and the second being eight amino acids in length. Human IGF-1 isa 70 aa basic peptide having the protein sequence shown in SEQ ID NO: 3,and has a 43% homology with proinsulin (Rinderknecht et al. (1978) J.Biol. Chem. 253:2769-2776). Human IGF-2 is a 67 amino acid basic peptidehaving the protein sequence shown in SEQ ID NO: 4. The IGFs demonstrateconsiderably less activity at the insulin B receptor isoform than theA-receptor isoform.

Applicants have previously identified IGF-1 based insulin peptidesanalogs, (wherein the native Gln-Phe dipeptide of the B-chain isreplaced by Tyr-Leu) that display high activity at the insulin receptor(see PCT/US2009/068713, the disclosure of which is incorporated herein).Such analogs (referred to herein as IGF YL analog peptides) are morereadily synthesized than insulin and enable the development ofco-agonist analogs for insulin and IGF-1 receptors, and selectiveinsulin receptor specific analogs. Furthermore, these insulin analogscan also be formulated as single chain insulin agonists in accordancewith the present disclosure.

Single chain insulin analogs comprising the insulin A and B chains havebeen previously prepared (see EP 1,193,272 and US 2007/0129284).However, previously disclosed single chain insulin analogs suffer thedisadvantage of either exhibiting low potency at the insulin receptorand/or relatively high potency at the IGF-1 receptor. The compounds ofthe present invention are prepared based on the discovery that singlechain high potency insulin agonists can be prepared by insertion of theIGF-1 C-peptide, or analogs thereof, as a connecting peptide linking theinsulin B and A peptides. The selective mutation of individual aminoacids in the C-peptide sequence yields peptides that are highlyselective for insulin relative to IGF-1 receptor.

In addition, the preparation of single chain insulin agonists are likelyto enhance the secondary structure of insulin and insulin analogs,yielding improvements in biophysical stability, therapeutic index and invivo pharmacology. The pharmacology of insulin is not glucose sensitive,and as such, the administration of insulin can result in excessiveaction that can lead to life-threatening hypoglycemia. Inconsistentpharmacology is a hallmark of insulin therapy such that it is extremelydifficult to normalize blood glucose without occurrence of hypoglycemia.Furthermore, native insulin is of short duration of action and requiresmodification to render it suitable for use in control of basal glucose.Single chain insulin analog peptides are suitable for further structuralenhancements that are envisioned to yield improved therapeutic index,through the use of prodrug chemistry; extended duration of action, bylinkage of plasma proteins such as albumin, or other modifications,including pegylation and acylation; enhanced physical stability, byglycosylation; and preferred tissue targeting through the use ofchemical modification with cholesterol or vitamin-like substituents. Thepreparation of single chain insulin analogs using a C-peptide linkeralso provides a novel structural location for where many of thesechemical modifications can be successfully deployed. The primary use ofsuch optimized insulin-agonists would be in the treatment ofinsulin-dependent diabetes.

SUMMARY

As disclosed herein applicants have discovered high potency single chaininsulin analogs. More particularly, in one embodiment a high potencysingle chain insulin agonist polypeptide is provided that is highlyselective for the insulin receptor relative to the IGF-1 receptor. Inaccordance with one embodiment the single chain insulin analog agonistcomprises a B chain and A chain of human insulin, or analogs orderivatives thereof, wherein the carboxy terminus of the B chain islinked to the amino terminus of the A chain via a linking moiety. In oneembodiment the B chain is a C-terminal truncated B chain having aminoacids B26-B30 removed (positions relative to the native insulin Bchain). In this embodiment the carboxy terminus of the B25 amino acid ofthe B chain is directly linked to a first end of the linking moiety andthe second end of the linking moiety is directly linked to the aminoterminus of the A1 amino acid of the A chain. In one embodiment, whereinthe linear single chain insulin analog comprises a C-terminal truncatedB chain, and the linking moiety is a peptide, the linking moiety is atleast 8 amino acids in length but not greater than 17 amino acids inlength. In embodiments where the linear single chain insulin analogcomprises a full length B chain, and the linking moiety is a peptide,the linking moiety is at least 8 amino acids in length but not greaterthan 12 amino acids in length.

In accordance with one embodiment the linking moiety comprises

-   -   a) a polyethylene glycol of 6-16 monomer units;    -   b) a non-native insulin or IGF amino acid sequence of at least 8        amino acids and no more than 17 amino acid in length, or a        peptidomimetic thereof, and comprising the sequence        X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈ (SEQ ID NO: 9); or    -   c) a combination of said polyethylene glycol and a shortened        amino acid sequence of 1 to 4 amino acids, wherein

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₃ through X₅₆ are each independently any amino acid; and

X₅₇ and X₅₈ are independently selected from the group consisting ofarginine, ornithine and lysine. The linking moiety may comprisenon-naturally occurring amino acids as well as a retroinverso fragment,or incorporation of non-peptide bonds such as an azapeptide bond (COsubstituted by NH) or pseudo-peptide bond (e.g. NH substituted with CH₂)or an ester bond (e.g., a depsipeptide, wherein one or more of the amide(—CONHR—) bonds are replaced by ester (COOR) bonds). When the linkingmoiety comprises an amino acid sequence it is also intended that any ofthe designated amino acids also encompasses derivatives of the indicatedamino acid including chemical modifications to the amino acid, e.g. anitro group in a tyrosine residue, or iodine in a tyrosine residue, orby conversion of a free carboxylic group to an ester group or to anamide group, or by converting an amino group to an amide by acylation,or by acylating a hydroxy group rendering an ester, or by alkylation ofa primary amine rendering a secondary amine or linkage of a hydrophilicmoiety to an amino acid side chain. Other derivatives are obtained byoxidation or reduction of the side-chains of the amino acid.

In one embodiment the a single chain insulin analog comprises ahydrophilic moiety linked to the N-terminus of the B chain or to a sidechain of an amino acid of the linking moiety. More particularly, in oneembodiment the single chain insulin agonist analog comprises the generalstructure B-LM-A wherein B represents an insulin B chain, A representsan insulin A chain and LM represents a linking moiety linking thecarboxy terminus of the B chain to the amino terminus of the A chain,wherein the linking moiety further comprises a hydrophilic moiety linkedto the side chain of an amino acid of the linking moiety and/or to theN-terminal alpha amine of the B chain (position B1 for insulin based Bchains and position B2 for IGF-1 based B chains) or the side chain of anamino acid at a position selected from the group consisting of A9, A14and A15 of the A chain or positions B1, B2, B10, B22, B28 or B29 of theB chain. In one embodiment the hydrophilic moiety is linked to theN-terminal alpha amine of the B chain (i.e. positions B1 for insulin orposition B2 for IGF insulin agonist using the insulin-based numberscheme). In one embodiment the hydrophilic moiety is a polyethylenechain and in a further embodiment the polyethylene chain is covalentlybound to the side chain of an amino acid of the linking moiety. In oneembodiment the linking moiety (LM) comprises an amino acid sequence ofno more than 17 amino acids in length and comprising the sequenceX₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈ (SEQ ID NO: 9), wherein

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₃ through X₅₆ are each independently any amino acid; and

X₅₇ and X₅₈ are independently arginine, lysine, cysteine, homocysteine,acetyl-phenylalanine or ornithine. In one embodiment the linking moiety(LM) comprises the sequence X₅₁X₅₂GSSSX₅₇X₅₈ (SEQ ID NO: 29) orX₅₁X₅₂GSSSX₅₇X₅₈APQT (SEQ ID NO: 46) wherein the amino acid designatedby X₅₇ or X₅₈ further comprises a hydrophilic moiety linked to the sidechain of the amino acid at that position. In one embodiment thehydrophilic moiety is a polyethylene glycol chain.

In one embodiment the linking moiety comprises an 8-17 amino acidsequence comprising the sequence GYGSSSX₅₇X₅₈ (SEQ ID NO: 85) orGYGSSSX₅₇X₅₈APQT; (SEQ ID NO: 37), or a peptidomimetic thereof;

wherein

X₅₇ and X₅₈ are independently arginine, lysine or ornithine. In oneembodiment the hydrophilic moiety is linked to the side chain of anamino acid located at position 8 of a linking moiety comprising SEQ IDNO: 37 or 85.

In one embodiment the linking moiety comprises

1) a linear polyethylene glycol chain of 6-16 monomer units,

2) an amino acid sequence at least 8 amino acids and no more than 12amino acid in length and comprising the sequence X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆RR(SEQ ID NO: 10), or

3) a combination of said polyethylene glycol chain and an amino acidsequence. In a further embodiment the linking moiety is selected fromthe group consisting of (Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆RR(Y₂)_(n) (SEQ ID NO:23), (Y₂)_(k)-GYGSSSX₅₇R (SEQ ID NO: 51) and

wherein

Y₁ is selected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13); and

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15);

n is 0 or 1;

k is 0 or 1;

m is an integer ranging from 7 to 16; and

X₄₆ through X₅₀ and X₇₀ through X₇₃ are each independently any aminoacid;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₃ through X₅₆ are each independently any amino acid; and

X₅₇ is arginine, lysine or ornithine.

In another embodiment the linking moiety is a relatively shortbifunctional non-peptide polymer linker that approximates the length ofan 8-16 amino acid sequence. In accordance with one embodiment thenon-peptide linking moiety is a polyethylene glycol linker ofapproximately 4 to 20, 8 to 18, 8 to 16, 8 to 14, 8 to 12, 10 to 14, 10to 12 or 11 to 13 monomers. In one embodiment a single chain insulinagonist is provided wherein the last five carboxy amino acids of the Bchain are deleted, and amino acid B25 is linked to amino acid A1 of theA chain via a polyethylene glycol linking moiety (LM) of at least 6 butno more than 20 monomer units, or at least 8 but no more than 14 monomerunits, or at least 10 but no more than 14 monomer units. In oneembodiment the linking moiety is a polyethylene glycol having a total of12 monomers.

In one embodiment the linking moiety of the single chain insulin analogis a bifunctional complex of the formula X—Y, wherein X is a non-peptidelinker (e.g., polyethylene glycol) and Y is an amino acid or a 2-4 aminoacid peptide. In one embodiment the last five amino acids of the nativeB chain carboxy terminus are deleted, and the carboxy terminus of aminoacid B25 is linked directly to X, and Y is directly linked to the aminoterminus of an insulin A chain. In another embodiment a single chaininsulin agonist is provided wherein the last five carboxy amino acids ofthe native B chain carboxy terminus are deleted, and amino acid B25 islinked to amino acid A1 of the A chain via a linking moiety comprising apolyethylene glycol of at least 8 but less than 14 monomer units inlength and a 2-5 amino acid sequence. The 2-5 amino acid sequence can belocated between the B chain and the polyethylene glycol chain or betweenthe A chain and the polyethylene glycol chain. However, when the 2-5amino acid sequence is located between the B chain and the polyethyleneglycol chain, the amino acid sequence is not YTPKT (SEQ ID NO: 16) orFNKPT (SEQ ID NO: 76). In one embodiment the linking moiety comprisestwo polyethylene chains separated by 1 or 2 amino acids. In oneembodiment the 1 or 2 amino acids are independently lysine or cysteine.In one embodiment the linking moiety comprises a two polyethylene chainsrepresenting a total of 8-12 or 10-14 or 12 monomeric units of ethyleneglycol separated by a single amino acid. In one embodiment the singleamino acid is lysine or cysteine.

The single chain insulin agonists of the present invention may comprisethe native B and A chain sequences or any of the known analogs orderivatives thereof that exhibit insulin agonist activity when linked toone another in a heteroduplex. As disclosed herein such A chain and Bchain peptides can be linked to one another by the linking moietiesdescribed herein to form a single chain insulin agonist. In accordancewith one embodiment the B chain comprises the sequence

R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 20), and the Achain comprises the sequence

GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22), wherein

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid;

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid, aspartic acid, asparagine, lysine, ornithine orglutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid and glutamicacid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₃₆ is tyrosine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, ornithine, lysineand arginine;

X₄₅ is tyrosine;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and an N-terminal amine; and

R₁₃ is COOH or CONH₂. In one embodiment X₈, X₂₅ and X₃₀ are eachhistidine.

In one embodiment, prodrug derivatives of single chain insulin analogsare provided wherein the prodrug comprises a dipeptide prodrug element(U-B) covalently linked to an active site of the single chain insulinanalog via an amide or ester linkage (see International applications WO2009/099763 and PCT/US2009/068713 the disclosures of which areincorporated herein). Subsequent removal of the dipeptide underphysiological conditions and in the absence of enzymatic activityrestores full activity to the single chain insulin analog. In oneembodiment the prodrug element comprises a dipeptide of the structureU-B, wherein U is an amino acid or a hydroxy acid, B is an N-alkylatedamino acid linked to said single chain insulin agonist through an amidebond between a carboxyl moiety of B and an amine of the single chaininsulin agonist, wherein U, B, or the amino acid of the single chaininsulin agonist to which U-B is linked is a non-coded amino acid. In oneembodiment the chemical cleavage half-life (t_(1/2)) of U-B from thesingle chain insulin agonist is at least about 1 hour to about 1 week inPBS under physiological conditions. In one embodiment the single chainagonist comprises a 4-amino phenylalanine at position A19, and dipeptideprodrug element U-B is linked to a single chain insulin agonist throughan amide bond between a carboxyl moiety of B and the para amine of4-amino phenylalanine.

Additional derivatives of the single chain insulin agonists areencompassed by the present disclosure including modifications thatimprove the solubility of the underlying single chain insulin agonist.In one embodiment the solubility of the single chain insulin agonistpeptide is enhanced by the covalent linkage of a hydrophilic moiety tothe peptide. In one embodiment the hydrophilic moiety is linked toeither the N-terminal amino acid of the B chain or to the side chain ofan amino acid located at the terminal end of the B chain (e.g. a lysinepresent at any of positions B26-30) or to the linking moiety binding theB chain to the A chain. In one embodiment the hydrophilic moiety isalbumin, including for example, albumins such as human serum albumin(HSA) and recombinant human albumin (rHA). In one embodiment thehydrophilic moiety is a polyethylene glycol (PEG) chain, having amolecular weight selected from the range of about 500 to about 40,000Daltons. In one embodiment the polyethylene glycol chain has a molecularweight selected from the range of about 500 to about 5,000 Daltons. Inanother embodiment the polyethylene glycol chain has a molecular weightof about 10,000 to about 20,000 Daltons.

Acylation or alkylation can increase the half-life of the single chaininsulin analog peptides, and prodrug derivatives thereof, incirculation. Acylation or alkylation can advantageously delay the onsetof action and/or extend the duration of action at the insulin receptors.The insulin analogs may be acylated or alkylated at the same amino acidposition where a hydrophilic moiety is linked (including, for example atposition 8 of the linking moiety), or at a different amino acidposition.

Also encompassed by the present disclosure are pharmaceuticalcompositions comprising the single chain insulin analog agonist, andderivatives thereof, and a pharmaceutically acceptable carrier. Inaccordance with one embodiment a pharmaceutical composition is providedcomprising any of the single chain insulin analogs disclosed herein, orderivative thereof, preferably at a purity level of at least 90%, 91%,92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%, and a pharmaceuticallyacceptable diluent, carrier or excipient. Such compositions may containa single chain insulin agonist peptide as disclosed herein at aconcentration of at least 0.5 mg/ml, 1 mg/ml, 2 mg/ml, 3 mg/ml, 4 mg/ml,5 mg/ml, 6 mg/ml, 7 mg/ml, 8 mg/ml, 9 mg/ml, 10 mg/ml, 11 mg/ml, 12mg/ml, 13 mg/ml, 14 mg/ml, 15 mg/ml, 16 mg/ml, 17 mg/ml, 18 mg/ml, 19mg/ml, 20 mg/ml, 21 mg/ml, 22 mg/ml, 23 mg/ml, 24 mg/ml, 25 mg/ml orhigher. In one embodiment the pharmaceutical compositions compriseaqueous solutions that are sterilized and optionally stored withinvarious package containers. In other embodiments the pharmaceuticalcompositions comprise a lyophilized powder. The pharmaceuticalcompositions can be further packaged as part of a kit that includes adisposable device for administering the composition to a patient. Thecontainers or kits may be labeled for storage at ambient roomtemperature or at refrigerated temperature.

In accordance with one embodiment an improved method of regulating bloodglucose levels in insulin dependent patients is provided. The methodcomprises the steps of administering to a patient a single chain insulinagonist peptide, or derivative thereof, in an amount therapeuticallyeffective for the control of diabetes.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1. is a schematic overview of the two step synthetic strategy forpreparing human insulin. Details of the procedure are provided inExample 1.

FIG. 2 is a graph comparing insulin receptor specific binding ofsynthetic human insulin relative to purified native insulin. Thesynthetic insulin was produced by the approach detailed in FIG. 1 wherethe A⁷-B⁷ bond is the first disulfide formed. As indicated by the datapresented in the graph, the two molecules have similar bindingactivities.

FIG. 3 is a graph comparing relative insulin receptor binding of nativeinsulin and the A19 insulin analog (Insulin(p-NH₂—F)¹⁹). As indicated bythe data presented in the graph, the two molecules have similar bindingactivities.

FIG. 4 is a graph comparing relative insulin receptor binding of nativeinsulin and the IGF1(Y^(B16)L^(B17)) analog. As indicated by the datapresented in the graph, the two molecules have similar bindingactivities.

FIG. 5 is an alignment of the human proinsulin (A chain, SEQ ID NO: 1; Bchain, SEQ ID NO: 2 and the C chain, SEQ ID NO: 92) and insulin-likegrowth factors I (A chain, SEQ ID NO: 5; B chain, SEQ ID NO: 6 and the Cchain, SEQ ID NO: 17) and II (A chain, SEQ ID NO: 7; B chain, SEQ ID NO:8 and the C chain, SEQ ID NO: 198) amino acid sequences. The sequencesof the D chain for IGF I (SEQ ID NO: 199) and IGF II (SEQ ID NO: 200)are also provided. The alignment demonstrates that these three peptidesshare a high level of sequence identity (* indicates a space with nocorresponding amino acid and a dash (-) indicates the identical aminoacid as present in insulin).

FIG. 6 is a schematic drawing of the synthetic scheme used to preparethe IGF1(Y^(B16)L^(B17))(p-NH₂—F)^(A19) prodrug derivatives. Thespecific derivative is p-NH2-F where the aromatic amine is acylated withthe dipeptide Aib-Ala, which serves as a negative control since thisdipeptide does not cleave under physiological conditions.

FIG. 7 is a graph comparing relative insulin receptor binding ofIGF1(Y^(B16)L^(B17))(p-NH₂—F)^(A19) and the dipeptide extended form ofIGF1(Y^(B16)L^(B17))(p-NH₂—F)^(A19)-AibAla. The synthesis of thisprodrug is shown in FIG. 6 where the dipeptide AibAla is bound atposition A19 (i.e. IGF1(Y^(B16)L^(B17))(AibAla). The dipeptide does notreadily cleave under physiological conditions and thus the activity isextremely low which demonstrates the ability of acylation at this sitewith dipeptide to silence bioactivity. This constitutes one of the twocentral ingredients of a prodrug, low activity in the prodrug form.

FIG. 8A-8C provides the activity of a dimer prepared in accordance withthe present disclosure. FIG. 8A shows the structure of an IGF-1 singlechain dimer that comprises two single chain IGF^(B16B17) analog peptides(IGF-1B chain[C⁰H⁵Y¹⁶L¹⁷O²²]-A chain[O^(9,14,15)N^(18,21)]; SEQ ID NO:93) linked together by a disulfide bond between the side chains of theamino terminus of the B chains. The native insulin disulfides (A⁶-A¹¹,A⁷-B⁷, A²⁰-B¹⁹) are not shown but are resident in the dimer form. Thesingle chain form of the disulfide dimer can be converted to a two-chainform by selective proteolytic digestion of the two Arg-Gly bonds asdenoted by the arrows. FIG. 8B is a graph demonstrating the relativeinsulin receptor binding of insulin, a single chain IGF^(B16B17) analogpeptide dimer and a two chain IGF^(B16B17) analog peptide dimer. FIG. 8Cis a graph demonstrating the relative activity of insulin, and a twochain IGF^(B16B17) analog peptide dimer to induce insulin receptorphosphorylation.

FIG. 9A-9C shows the degradation of a prodrug form of an IGF A chainpeptide: (Aib-Pro on (pNH₂—F)¹⁹ of IGF1A(Ala)^(6,7,11,20)amide. Thedipeptide was incubated in PBS, pH 7.4 at 37° C. for predeterminedlengths of time. Aliquots were taken at 20 minutes (FIG. 9A), 81 minutes(FIG. 9B) and 120 minutes (FIG. 9C) after beginning the incubation, werequenched with 0.1% TFA and tested by analytical HPLC. Peak a(IGF1A(Ala)^(6,7,11,20)(pNH₂—F)¹amide) and b(IGF1A(Ala)^(6,7,11,20)(Aib-Pro-pNH—F)¹⁹amide) were identified withLC-MS and quantified by integration of peak area. The data indicate thespontaneous, non-enzymatic conversion ofIGF1A(Ala)^(6,7,11,20)(Aib-Pro-pNH—F)¹⁹amide toIGF1A(Ala)^(6,7,11,20)(pNH₂—F)¹amide over time.

FIGS. 10A & 10B are graphs depicting the in vitro activity of theprodrug Aib,dPro-IGF1YL (dipeptide linked through the A19 4-aminoPhe).FIG. 10A is a graph comparing relative insulin receptor binding ofnative insulin (measured at 1 hour at 4° C.) and the A19 IGF prodrugderivative (Aib,dPro-IGF1YL) over time (0 hours, 2.5 hours and 10.6hours) incubated in PBS. FIG. 10B is a graph comparing relative insulinreceptor binding of native insulin and the A19 IGF prodrug derivative(Aib,dPro-IGF1YL) over time (0 hours, 1.5 hours and 24.8 hours)incubated in 20% plasma/PBS at 37° C. As indicated by the data presentedin the graph, increased activity is recovered from the A19 IGF prodrugderivative sample as the prodrug form is converted to the active IGF1YLpeptide.

FIGS. 11A & 11B are graphs depicting the in vitro activity of theprodrug dK,(N-isobutylG)-IGF1YL (dipeptide linked through the A194-aminoPhe). FIG. 11A is a graph comparing relative insulin receptorbinding of native insulin and the A19 IGF prodrug derivative (IGF1YL:dK,(N-isobutylG) over time (0 hours, 5 hours and 52 hours) incubated inPBS. FIG. 11B is a graph comparing relative insulin receptor binding ofnative insulin and the A19 IGF prodrug derivative (IGF1YL:dK,(N-isobutylG) over time (0 hours, 3.6 hours and 24.8 hours) incubatedin 20% plasma/PBS at 37° C. As indicated by the data presented in thegraph, increased activity is recovered from the A19 IGF prodrugderivative sample as the prodrug form is converted to the active IGF1YLpeptide.

FIGS. 12A & 12B are graphs depicting the in vitro activity of theprodrug dK(e-acetyl),Sar)-IGF1YL (dipeptide linked through the A194-aminoPhe). FIG. 12A is a graph comparing relative insulin receptorbinding of native insulin (measured at 1 hour at 4° C.) and the A19 IGFprodrug derivative (IGF1YL: dK(e-acetyl),Sar) over time (0 hours, 7.2hours and 91.6 hours) incubated in PBS. FIG. 12B is a graph comparingrelative insulin receptor binding of native insulin and the A19 IGFprodrug derivative (IGF1YL: dK(e-acetyl),Sar) over time (0 hours, 9hours and 95 hours) incubated in 20% plasma/PBS at 37° C. As indicatedby the data presented in the graph, increased activity is recovered fromthe A19 IGF prodrug derivative sample as the prodrug form is convertedto the active IGF1YL peptide.

FIG. 13 is a graph comparing relative insulin receptor binding of nativeinsulin heteroduplex and the IGF-1 A and B chain heteroduplex and asingle chain IGF-1 analog wherein the carboxy terminus of the B chain isdirectly linked to the N-terminus of the IGF-1 A chain.

FIG. 14 is a graph comparing relative insulin receptor binding of nativeinsulin heteroduplex, IGF-1, the IGF-1 delta heteroduplex and a singlechain IGF-1 delta single chain analog wherein the carboxy terminus ofthe B chain is linked to the N-terminus of the IGF-1 A chain through apeptide linker consisting of the sequence GYGSSSOR (SEQ ID NO: 65),wherein the IGF-1 delta analog comprises the native IGF-1 sequence withthe following amino acid substitutions: HA8, OA9, OA14, OA15, QA17,NA21, YB16, LB17, OB22.

FIG. 15 is a bar graph depicting the relative in vitro binding activityof single chain insulin analogs at the IGF-1 receptor or the A or Bsubtype insulin receptors wherein the carboxy terminus of the nativeinsulin B chain is linked to the amino terminus of the native insulin Achain via the IGF-1 C peptide or various derivative of the IGF-1 Cpeptide. In the B⁰C¹A⁰ insulin analog nomenclature, the B⁰ and A⁰designations refer to the insulin sequences of the A and B chain, whileC¹ designates the IGF-1 C peptide. As shown by the data a single chaininsulin analog that links the B chain to the A chain via the IGF-1 Cpeptide is a potent insulin agonist. Furthermore, modifications ofposition 2 (e.g., substituting alanine for native tyrosine), oralternatively deleting the last four amino acids of the IGF-1 C linkingpeptide, generates a high potency, insulin selective single chaininsulin analog.

FIG. 16 is a bar graph depicting the relative in vitro binding activityof single chain insulin analogs of the formula B⁰C¹A⁰ at the IGF-1receptor or the A or B subtype insulin receptors wherein the nativesequence of the linking IGF-1 C peptide has been modified by theindicated amino acid substitutions at position 1, 2, 3, 4 or 8. In theB⁰C¹A⁰ insulin analog nomenclature, the B⁰ and A⁰ designations refer tothe insulin sequences of the A and B chain, while C¹ designates theIGF-1 C peptide.

FIG. 17 is a bar graph depicting the relative in vitro binding activityand phosphorylation activity of single chain B⁰C¹A⁰ insulin analogs atthe A subtype insulin. The activity of the native IGF-1 C peptide (010)relative to various amino acid substitutions or deletions was compared.In the B⁰C¹A⁰ insulin analog nomenclature, the B⁰ and A⁰ designationsrefer to the insulin sequences of the A and B chain, while C¹ designatesthe IGF-1 C peptide.

FIG. 18 is a bar graph depicting the relative in vitro binding activityand phosphorylation activity of single chain B⁰C¹A⁰ insulin analogs atthe A subtype insulin wherein the native sequence of the linking IGF-1 Cpeptide has been modified by the indicated amino acid substitutions atposition 1, 2, 3, 4 or 8. This data in conjunction with the dataprovided in FIG. 17 demonstrate the consistency between the binding andphosphorylation activity of the insulin analogs.

FIG. 19 is a schematic drawing showing the preparation of an IGF-1 YLsingle chain insulin analog that uses a PEG polymer as the linkingmoiety. Seg 1 (SEQ ID NO: 201) is joined to Seg 2 (SEQ ID NO: 202) toform the IGF-1 YL single chain insulin analog (SEQ ID NO: 203).

FIGS. 20A &B are graphs depicting the relative in vitro binding activity(FIG. 20 A) and phosphorylation activity (FIG. 20 B) of single chaininsulin analogs linked via a 4, 8 or 16 monomeric PEG linking moietyrelative to the native insulin heteroduplex.

FIGS. 21A & B are graphs demonstrating the phosphorylation activity ofsingle chain insulin analogs at the insulin and IGF-1 receptors. Singlechain insulin analogs comprise the full length IGF B and A chains linkedtogether by either a 4, 8 or 16 monomeric PEG linking moiety (SEQ ID NO203) have relatively low insulin potency as compared to native insulinheteroduplex (FIG. 21A), although they exhibit 10 to 100 fold greateractivity relative to a single chain insulin analog without a PEG orother linking moiety (see Table 16). However, when the last 5 carboxyamino acids of the IGF B chain (B²⁶⁻³⁰) are deleted (DesV) and thecarboxy terminus of the remaining B chain is linked to the IGF A chainby either an 8, 12 or 16 monomeric PEG linking moiety, the single chainanalogs exhibit at least equivalent or higher potency relative to thenative insulin heteroduplex.

FIG. 22A-C are graphs demonstrating the near equivalency of single chaininsulin analogs at the insulin receptor. FIG. 22A provides datacomparing native insulin (a heteroduplex of an A chain of SEQ ID NO: 1and a B chain of SEQ ID NO: 2) to single chain insulin analogscomprising either the native insulin sequences PE InsPeg₁₂DesV (SEQ IDNO: 204) and InsPeg₁₂DesV (SEQ ID NO: 205) or the IGF-1 A and IGF-1 YL Bchains (IGF-1Peg₁₂DesV; SEQ ID NO: 206) wherein the last 5 carboxy aminoacids of the B chain are the deleted (DesV) and the B chain is linked toits corresponding insulin or IGF-1 A chain by a 12 monomeric PEG linkingmoiety. FIGS. 22B & C are graphs showing the results of comparativeinsulin tolerance tests conducted on mice comparing the ability ofMIU-38 [a single chain insulin analog wherein the B and A chains arelinked via PEG: [B¹(H5,H10,Y16,L17)25(Peg12)A¹(H8,N18,N21)], see FIG.22B and MIU-35 [a single chain insulin analog wherein the B and A chainsare linked via a peptide linker: B¹(H5,H10,Y16,L17)25-C¹-A¹(H8,N18,N21),see FIG. 22C, relative to a vehicle control, to reduce and sustain lowblood glucose concentration. Two experiments were conducted whereinMIU-38 and MIU-35 were administered at 27 and 90 nmol/kg.

FIG. 23 is a table demonstrating various histidine substitutions tosingle chain insulin/IGF-1 based analogs (SEQ ID NO: 207). Thesubstitution of histidine at position 8 of the IGF-1 A chain can enhancethe potency of IGF based single chain insulin analog agonists.

FIG. 24 is a comparative analysis of the single chain peg-linked analogsactivities at the insulin and IGF-1 receptors as measured by receptorsignaling through phosphorylation.

FIG. 25 is a graph demonstrating that single chain analogs comprising anIGF-1 A chain have enhanced resistance to degradation by the specificinsulin degrading enzyme (IDE) relative to single chain analogscomprising an insulin A chain.

FIG. 26 is a graph demonstrating the relative activity of IGF-1, insulinand insulin/IGF chimera to induce in vitro cellular proliferation. Theresults indicate that the insulin activity associated with the IGF-1single chain insulin analogs does not correlate with the proliferationactivity associated with native IGF-1.

FIG. 27 is a comparative analysis of single chain peg-linked analogsactivities at the insulin and IGF-1 receptors as measured by receptorsignaling through phosphorylation. Single chain analogs having thesequence of SEQ ID NO: 203 and using PEG chain linkers of differentlengths were analyzed to measure how different sized PEG linkingmoieties impact in vitro activities at the insulin and IGF-1 receptors.The data presented in FIG. 27 reveals that a PEG₁₂DesV construct(wherein the 5 carboxy terminal amino acid of the B chain have beendeleted) provides the most potent compound.

FIGS. 28A and 28B is a comparative analysis of single chain peg/aminoacid-linked analogs in vitro activities at the insulin and IGF-1receptors as measured by receptor binding and receptor signaling throughphosphorylation. FIG. 28A shows the in vitro activity of a single chainanalog comprising a PEG₁₂ chain with an inserted single amino acid(glycine or lysine) as the linking moiety linking a DesV B chain to thenative insulin A chain. FIG. 28B shows in vitro activity of a singlechain analog comprising a linking moiety comprising a PEG₁₂ chain withtwo lysine residues inserted (single chain peg/(lysine)₂-linked analog).

FIG. 29A-E provides in vivo data of mice administered various singlechain insulin analogs. FIG. 29A provides in vitro comparative analysisof single chain peg-linked analogs activities at the insulin receptor asmeasured by receptor binding and receptor signaling throughphosphorylation; FIGS. 29B and 29C provide data on blood glucoseconcentrations over 8 hours after administration of the listed analogs.FIGS. 29D and 29E provide data on blood glucose AUC values afteradministration of the listed analogs at two different concentrations (27nmol/kg and 90nmol/kg).

FIGS. 30A-30D are graphs showing the results of comparative insulintolerance tests conducted on mice comparing the ability of human insulinto reduce and sustain low blood glucose concentration relative to threedifferent acylated insulin analogs. The compounds were tested at twodifferent concentrations (27 nmol/kg and 90nmol/kg). The acylatedinsulins included MIU-41, MIU-36 and MIU-37. MIU-41[B¹(H5,H10,Y16,L17)25a: A¹(H8,rEC16-K14,N18,N21)], is a two chaininsulin analog having a C16 acylation via a gamma glutamic acid linkerattached to a lysine residue located at position A14. MIU-36[B¹(C16-K0,H5,H10,Y16,L17)25a: A¹(N18,N21)], is a two chain insulinanalog having a C16 acylation linked to the N-terminus of the B chain).MIU-37 [B¹(H5,H10,Y16,L17,C16rE-K22)25a: A¹(N18,N21)], is a two chaininsulin analog having a C16 acylation via a gamma glutamic acid linkerattached to a lysine residue located at position B22.

FIGS. 31A-31D show the results of comparative insulin tolerance testsconducted on mice comparing the activity of the commercially availableacylated insulin analog (Detemir) relative to the acylated two chaininsulin analog MIU-55. MIU-55 [B¹(H5,10,Y16,L17,C16rE-K22)25a:A¹(N18,N21)] has the C-terminal 5 amino acids of the B chain deleted andterminates as a B chain amide. It is acylated with a C16 fatty acidthrough a gamma Glu linker at the ε-amino group of Lys B22. The resultsindicate that MIU-55 is about one third as potent as Detemir (see FIGS.31A and 31B). The data also indicate that the acylated forms of insulinare longer acting than the non-acylated forms and that MIU-55 while lesspotent than Detemir, exhibits a similar profile as Detemir. FIGS. 31Cand 31D provide data on blood glucose AUC values after administration ofthe listed analogs.

FIGS. 32A-32D show the results of comparative insulin tolerance testsconducted on mice comparing the activity of the commercially availableacylated insulin analog (Detemir) relative to the acylated two chaininsulin analog MIU-49. MIU-49 [B¹(C16-rE,H5,Aib9,H10,E13-K17,Y16)25a:A¹(N18,N21)] is a two chain insulin agonist having the C-terminal 5amino acids of the B chain deleted and acylated with a C16 fatty acidthrough a gamma Glu linker at the α-amino group of Gly B2). The resultsindicate that MIU-49 is about one third as potent as Detemir (see FIGS.32A and 32B. The data also indicate that the acylated forms of insulinare longer acting than the non-acylated forms and that MIU-49 while lesspotent than Detemir, exhibits a similar profile as Detemir. FIGS. 32Cand 32D provide data on blood glucose AUC values after administration ofthe listed analogs.

FIGS. 33A-33D represents the results obtained from a comparative insulintolerance test for Detemir and MIU-56 using C57/Blk mice. MIU-56 is aninsulin single chain analog B¹(H5,Y16,L17)25α-PEG8-K-PEG4-A¹(N18,21)comprising a 20 kDa PEG linked to the side chain of the single lysineresidue in the linking moiety (PEG8-K-PEG4) that joins the A chain andthe B chain. FIGS. 33A and 33B are graphs showing the results of insulintolerance tests comparing the ability of the acylated insulin analogDetemir relative to the pegylated single chain insulin analog MIU-56 toreduce and maintain low blood glucose levels. FIGS. 33C and 33D show theblood glucose AUC_(24 hrs) in mice administered Detemir and MIU-56,respectively.

FIGS. 34A-34F represents the results obtained from a comparative insulintolerance test for MIU-56 and MIU-57 using C57/Blk mice. MIU-57 is aninsulin single chain analog (B¹(H5,Y16,L17)25-C¹-A¹(N18,21) comprising a20 kDa PEG linked to the N-terminus of the B chain. FIGS. 34A and 34Bare graphs showing the results of insulin tolerance tests comparingMIU-56 and MIU-57. FIGS. 34C and 34D show the blood glucose AUC_(24 hrs)in mice administered MIU-56 and MIU-57, respectively. Results fromcomparative insulin dose titrations of MIU-56 and MIU-57 reveal that asimilar profile is obtained in mice for dosages ranging from 20 nmol/kgthrough 80 nmol/kg (see FIGS. 34E and 34F). A dimer (MIU 58) wasprepared comprising two insulin single chain analogs(B1(H5,Y16,L17)25-C1-A¹(N18,21) linked head to head via a 20 kDa PEGchain. FIGS. 34G-34J represents the results obtained from a comparativeinsulin tolerance test for MIU-57 and MIU-58 using C57/Blk mice. FIGS.34G and 34H are graphs showing the results of insulin tolerance testscomparing MIU-57 (monomer) and MIU-58 (dimer). FIGS. 34I and 34J showthe blood glucose AUC_(24 hrs) in mice administered MIU-57 and MIU-58,respectively.

FIGS. 35A-35B provide data from a comparative insulin dose titration oftwo pegylated insulin derivatives. The insulin derivatives differ basedon the placement of a 20 kDa PEG which is linked to the N-terminus (FIG.35A) of MIU-59, or to the side chain of amino acid B29, of an insulinanalog MIU-60, wherein the A1 and B1 amino acids have been carbamylated(FIG. 35B).

FIGS. 36A-36D provide data from a comparative insulin dose titration ofthe three single chain insulin analogs MIU-67, MIU-68 and MIU-69, eachcomprising two PEG chains of 10 kDa each relative to the singlepegylated (20K PEG) native insulin derivative (MIU-59). Moreparticularly, the activities of single chain insulin analogs MIU-67(B¹(H5,Y16,L17)25-C¹(K8)-A¹(N18,21)) having two PEG chains (10K each)one linked at the N-terminus and the other at amino acid 8 of thelinking moiety (position C8), MIU-68 (B¹(H5,Y16,L17,K22)25-C¹(K8)-A¹(N18,21)) having two PEG chains (10K each) one linked atthe N-terminus and the other at amino acid B22 and MIU-69(B¹(H5,Y16,L17)25-C¹(K8)-A¹(K14, N18,21)) having two PEG chains (10Keach) one linked at the N-terminus and the other at amino acid A14 werecompared. Each compound was administered at two dosages (20 and 80nmol/kg).

FIGS. 37A & 37B Diabetic mice (db/db mice) were administered pegylatedinsulin analogs to compare their relative activity in relation tocommercially available insulin analogs. The x-axis indicates theconcentration of the administered compound (i.e., vehicle control, 30 or90 nmol/kg or 60 nmol/kg for Humulin and 30, 90 and 240 for Levemir). Inparticular, insulin analogs Levemir and Humulin were compared to thepegylated insulin analogs MIU-59 (native insulin analog having a single20 kDa PEG linked to its N-terminus) and MIU-66 (native insulin analoghaving a single 20 kDa PEG linked to its N-terminus and the aminoterminus of the A and B chain carbamylated. Both MIU-59 and MIU-66 haveimproved activity relative to Levemir and Humulin (see FIGS. 37A at 12hrs and 37B at 24 hours).

FIG. 38 is a graph showing the results of a comparative insulintolerance test conducted in normal mice for a prodrug two chain insulinanalog acylated at the dipeptide prodrug element (MIU-29:[B¹(Y16,L17,Y25)29a:

A¹(aF19-dLys(Ac),NLeu)] relative to it parent insulin analog (MIU-27:[B¹(Y16,L17,Y25)29a: A¹(aF19-)]. The prodrug derivative MIU-29 comprisesa 4-amino-phenylalanine substitution at position A19 wherein a dipeptidedLys(Ac),NLeu has been covalently linked at the 4-amino position of theA19 residue and the side chain of the lysine of the dipeptide elementhas been acylated with a C14 fatty acid. This dipeptide will autocleaveunder physiological conditions with a half life of approximately 5hours. After incubating MIU-29 for 24 hours ex vivo, the resultantcompound (designated “MIU-29c”) was administered to mice and its abilityto lower blood glucose was compared to parent compound. As shown in FIG.38 the two compounds performed almost identically.

DETAILED DESCRIPTION Definitions

In describing and claiming the invention, the following terminology willbe used in accordance with the definitions set forth below.

The term “about” as used herein means greater or lesser than the valueor range of values stated by 10 percent, but is not intended todesignate any value or range of values to only this broader definition.Each value or range of values preceded by the term “about” is alsointended to encompass the embodiment of the stated absolute value orrange of values.

As used herein, the term “prodrug” is defined as any compound thatundergoes chemical modification before exhibiting its pharmacologicaleffects.

As used herein the term “amino acid” encompasses any molecule containingboth amino and carboxyl functional groups, wherein the amino andcarboxylate groups are attached to the same carbon (the alpha carbon).The alpha carbon optionally may have one or two further organicsubstituents. For the purposes of the present disclosure designation ofan amino acid without specifying its stereochemistry is intended toencompass either the L or D form of the amino acid, or a racemicmixture. However, in the instance where an amino acid is designated byits three letter code and includes a superscript number, the D form ofthe amino acid is specified by inclusion of a lower case d before thethree letter code and superscript number (e.g., dLys⁻¹), wherein thedesignation lacking the lower case d (e.g., Lys⁻¹) is intended tospecify the native L form of the amino acid. In this nomenclature, theinclusion of the superscript number designates the position of the aminoacid in the insulin analog sequence, wherein amino acids that arelocated within the insulin analog sequence are designated by positivesuperscript numbers numbered consecutively from the N-terminus.Additional amino acids linked to the insulin analog peptide either atthe N-terminus or through a side chain are numbered starting with 0 andincreasing in negative integer value as they are further removed fromthe insulin analog sequence. For example, the position of an amino acidwithin a dipeptide prodrug linked to the N-terminus of an insulin analogis designated aa⁻¹-aa⁰-insulin analog, wherein aa⁰ represents thecarboxy terminal amino acid of the dipeptide and aa⁻¹ designates theamino terminal amino acid of the dipeptide.

As used herein the term “hydroxyl acid” refers to amino acids that havebeen modified to replace the alpha carbon amino group with a hydroxylgroup.

As used herein the term “non-coded amino acid” encompasses any aminoacid that is not an L-isomer of any of the following 20 amino acids:Ala, Cys, Asp, Glu, Phe, Gly, His, Ile, Lys, Leu, Met, Asn, Pro, Gln,Arg, Ser, Thr, Val, Trp, Tyr.

A “dipeptide” is a compound formed by linkage of an alpha amino acid oran alpha hydroxyl acid to another amino acid, through a peptide bond.

As used herein the term “chemical cleavage” absent any furtherdesignation encompasses a non-enzymatic reaction that results in thebreakage of a covalent chemical bond.

A “bioactive polypeptide” refers to polypeptides which are capable ofexerting a biological effect in vitro and/or in vivo.

As used herein a general reference to a peptide is intended to encompasspeptides that have modified amino and carboxy termini. For example, anamino acid sequence designating the standard amino acids is intended toencompass standard amino acids at the N- and C-terminus as well as acorresponding hydroxyl acid at the N-terminus and/or a correspondingC-terminal amino acid modified to comprise an amide group in place ofthe terminal carboxylic acid.

As used herein an “acylated” amino acid is an amino acid comprising anacyl group which is non-native to a naturally-occurring amino acid,regardless by the means by which it is produced. Exemplary methods ofproducing acylated amino acids and acylated peptides are known in theart and include acylating an amino acid before inclusion in the peptideor peptide synthesis followed by chemical acylation of the peptide. Insome embodiments, the acyl group causes the peptide to have one or moreof (i) a prolonged half-life in circulation, (ii) a delayed onset ofaction, (iii) an extended duration of action, (iv) an improvedresistance to proteases, such as DPP-IV, and (v) increased potency atthe IGF and/or insulin peptide receptors.

As used herein, an “alkylated” amino acid is an amino acid comprising analkyl group which is non-native to a naturally-occurring amino acid,regardless of the means by which it is produced. Exemplary methods ofproducing alkylated amino acids and alkylated peptides are known in theart and including alkylating an amino acid before inclusion in thepeptide or peptide synthesis followed by chemical alkylation of thepeptide. Without being held to any particular theory, it is believedthat alkylation of peptides will achieve similar, if not the same,effects as acylation of the peptides, e.g., a prolonged half-life incirculation, a delayed onset of action, an extended duration of action,an improved resistance to proteases, such as DPP-IV, and increasedpotency at the IGF and/or insulin receptors.

As used herein, the term “pharmaceutically acceptable carrier” includesany of the standard pharmaceutical carriers, such as a phosphatebuffered saline solution, water, emulsions such as an oil/water orwater/oil emulsion, and various types of wetting agents. The term alsoencompasses any of the agents approved by a regulatory agency of the USFederal government or listed in the US Pharmacopeia for use in animals,including humans.

As used herein the term “pharmaceutically acceptable salt” refers tosalts of compounds that retain the biological activity of the parentcompound, and which are not biologically or otherwise undesirable. Manyof the compounds disclosed herein are capable of forming acid and/orbase salts by virtue of the presence of amino and/or carboxyl groups orgroups similar thereto.

Pharmaceutically acceptable base addition salts can be prepared frominorganic and organic bases. Salts derived from inorganic bases, includeby way of example only, sodium, potassium, lithium, ammonium, calciumand magnesium salts. Salts derived from organic bases include, but arenot limited to, salts of primary, secondary and tertiary amines.

Pharmaceutically acceptable acid addition salts may be prepared frominorganic and organic acids. Salts derived from inorganic acids includehydrochloric acid, hydrobromic acid, sulfuric acid, nitric acid,phosphoric acid, and the like. Salts derived from organic acids includeacetic acid, propionic acid, glycolic acid, pyruvic acid, oxalic acid,malic acid, malonic acid, succinic acid, maleic acid, fumaric acid,tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid,methanesulfonic acid, ethanesulfonic acid, p-toluene-sulfonic acid,salicylic acid, and the like.

As used herein, the term “hydrophilic moiety” refers to any compoundthat is readily water-soluble or readily absorbs water, and which aretolerated in vivo by mammalian species without toxic effects (i.e. arebiocompatible). Examples of hydrophilic moieties include polyethyleneglycol (PEG), polylactic acid, polyglycolic acid, apolylactic-polyglycolic acid copolymer, polyvinyl alcohol,polyvinylpyrrolidone, polymethoxazoline, polyethyloxazoline,polyhydroxyethyl methacrylate, polyhydroxypropyl methacrylamide,polymethacrylamide, polydimethylacrylamide, and derivatised cellulosessuch as hydroxymethylcellulose or hydroxyethylcellulose and co-polymersthereof, as well as natural polymers including, for example, albumin,heparin and dextran.

As used herein, the term “treating” includes prophylaxis of the specificdisorder or condition, or alleviation of the symptoms associated with aspecific disorder or condition and/or preventing or eliminating saidsymptoms. For example, as used herein the term “treating diabetes” willrefer in general to maintaining glucose blood levels near normal levelsand may include increasing or decreasing blood glucose levels dependingon a given situation.

As used herein an “effective” amount or a “therapeutically effectiveamount” of an insulin analog refers to a nontoxic but sufficient amountof an insulin analog to provide the desired effect. For example onedesired effect would be the prevention or treatment of hyperglycemia.The amount that is “effective” will vary from subject to subject,depending on the age and general condition of the individual, mode ofadministration, and the like. Thus, it is not always possible to specifyan exact “effective amount.” However, an appropriate “effective” amountin any individual case may be determined by one of ordinary skill in theart using routine experimentation.

The term, “parenteral” means not through the alimentary canal but bysome other route such as intranasal, inhalation, subcutaneous,intramuscular, intraspinal, or intravenous.

Throughout the application, all references to a particular amino acidposition by letter and number (e.g. position A5) refer to the amino acidat that position of either the A chain (e.g. position A5) or the B chain(e.g. position B5) in the respective native human insulin A chain (SEQID NO: 1) or B chain (SEQ ID NO: 2), or the corresponding amino acidposition in any analogs thereof. For example, a reference herein to“position B28” absent any further elaboration would mean thecorresponding position B27 of the B chain of an insulin analog in whichthe first amino acid of SEQ ID NO: 2 has been deleted. Similarly, aminoacids added to the N-terminus of the native B chain are numberedstarting with B0, followed by numbers of increasing negative value(e.g., B−1, B−2 . . . ) as amino acids are added to the N-terminus.Alternatively, any reference to an amino acid position in the linkingmoiety of a single chain analog, is made in reference to the native Cchain of IGF 1 (SEQ ID NO: 17). For example, position 9 of the native Cchain (or the “position C9”) has an alanine residue.

As used herein the term “native insulin peptide” is intended todesignate the 51 amino acid heteroduplex comprising the A chain of SEQID NO: 1 and the B chain of SEQ ID NO: 2, as well as single-chaininsulin analogs that comprise SEQ ID NOS: 1 and 2. The term “insulinpeptide” as used herein, absent further descriptive language is intendedto encompass the 51 amino acid heteroduplex comprising the A chain ofSEQ ID NO: 1 and the B chain of SEQ ID NO: 2, as well as single-chaininsulin analogs thereof (including for example those disclosed inpublished international application WO96/34882 and U.S. Pat. No.6,630,348, the disclosures of which are incorporated herein byreference), including heteroduplexes and single-chain analogs thatcomprise modified analogs of the native A chain and/or B chain andderivatives thereof. Such modified analogs include modification of theamino acid at position A19, B16 or B25 to a 4-amino phenylalanine or oneor more amino acid substitutions at positions selected from A5, A8, A9,A10, A12, A14, A15, A17, A18, A21, B1, B2, B3, B4, B5, B9, B10, B13,B14, B17, B20, B21, B22, B23, B26, B27, B28, B29 and B30 or deletions ofany or all of positions B1-4 and B26-30. Insulin peptides as definedherein can also be analogs derived from a naturally occurring insulin byinsertion or substitution of a non-peptide moiety, e.g. a retroinversofragment, or incorporation of non-peptide bonds such as an azapeptidebond (CO substituted by NH) or pseudo-peptide bond (e.g. NH substitutedwith CH₂) or an ester bond (e.g., a depsipeptide, wherein one or more ofthe amide (—CONHR—) bonds are replaced by ester (COOR) bonds).

An “A19 insulin analog” is an insulin peptide that has a substitution of4-amino phenylalanine or 4-methoxy phenylalanine for the native tyrosineresidue at position 19 of the A chain of native insulin.

As used herein an “IGF^(B16B17) analog peptide” is a generic term thatcomprising an A chain and B chain heteroduplex, as well as single-chaininsulin analogs thereof, wherein the A chain comprises the peptidesequence of SEQ ID NO: 19 and the B chain comprises the sequence of SEQID NO: 21 as well as analogs of those sequences wherein the analog ofthe A chain and/or B chain comprise 1-3 further amino acidsubstitutions, with the proviso that the B chain does not comprise thesequence of SEQ ID NO: 2 and comprises a tyrosine at position B16 and aleucine at position B17.

An “IGF YL analog” is a peptide comprising an IGF A chain of SEQ ID NO:19 and an IGF B chain of SEQ ID NO: 58.

As used herein, the term “single-chain insulin analog” encompasses agroup of structurally-related proteins wherein insulin or IGF A and Bchains, or analogs or derivatives thereof, are covalently linked to oneanother to form a linear polypeptide chain. As disclosed herein thesingle-chain insulin analog comprises the covalent linkage of thecarboxy terminus of the B chain to the amino terminus of the A chain viaa linking moiety.

As used herein the term “insulin A chain”, absent further descriptivelanguage is intended to encompass the 21 amino acid sequence of SEQ IDNO: 1 as well as functional analogs and derivatives thereof, includingthe A chain of A19 insulin analogs and other analogs known to thoseskilled in the art, including modification of the sequence of SEQ ID NO:1 by one or more amino acid insertions, deletions or substitutions atpositions selected from A4, A5, A8, A9, A10, A12, A14, A15, A17, A18,A21.

As used herein the term “insulin B chain”, absent further descriptivelanguage is intended to encompass the 30 amino acid sequence of SEQ IDNO: 2, as well as modified functional analogs of the native B chain,including modification of the amino acid at position B16 or B25 to a4-amino phenylalanine or one or more amino acid insertions, deletions orsubstitutions at positions selected from B1, B2, B3, B4, B5, B9, B10,B13, B14, B17, B20, B21, B22, B23, B25, B26, B27, B28, B29 and B30 ordeletions of any or all of positions B1-4 and B26-30.

As used herein the term “derivative” is intended to encompass chemicalmodification to a compound (e.g., an amino acid), including chemicalmodification in vitro, e.g. by introducing a group in a side chain inone or more positions of a polypeptide, e.g. a nitro group in a tyrosineresidue, or iodine in a tyrosine residue, or by conversion of a freecarboxylic group to an ester group or to an amide group, or byconverting an amino group to an amide by acylation, or by acylating ahydroxy group rendering an ester, or by alkylation of a primary aminerendering a secondary amine or linkage of a hydrophilic moiety to anamino acid side chain. Other derivatives are obtained by oxidation orreduction of the side-chains of the amino acid residues in thepolypeptide.

As used herein the term IGF A chain, absent further descriptive languageis intended to encompass the 21 amino acid sequence of native IGF 1 orIGF 2 (SEQ ID NOs: 5 and 7 respectively), as well as functional analogsthereof known to those skilled in the art, including modification of thesequence of SEQ ID NO: 5 and 7 by one or more amino acid substitutionsat positions selected from A5, A8, A9, A10, A12, A14, A15, A17, A18,A21.

As used herein the term “IGF YL B chain”, absent further descriptivelanguage is intended to encompass an amino acid sequence comprising SEQID NO: 21, including for example the sequence of SEQ ID NO: 168, as wellas analogs of the IGF YL B chain and derivatives thereof, includingmodification of the amino acid at position B16 or B25 to a 4-aminophenylalanine or one or more amino acid substitutions at positionsselected from B1, B2, B3, B4, B5, B9, B10, B13, B14, B17, B20, B21, B22,B23, B26, B27, B28, B29 and B30 or deletions of any or all of positionsB1-4 and B26-30.

The term “identity” as used herein relates to the similarity between twoor more sequences. Identity is measured by dividing the number ofidentical residues by the total number of residues and multiplying theproduct by 100 to achieve a percentage. Thus, two copies of exactly thesame sequence have 100% identity, whereas two sequences that have aminoacid deletions, additions, or substitutions relative to one another havea lower degree of identity. Those skilled in the art will recognize thatseveral computer programs, such as those that employ algorithms such asBLAST (Basic Local Alignment Search Tool, Altschul et al. (1993) J. Mol.Biol. 215:403-410) are available for determining sequence identity.

As used herein, the term “selectivity” of a molecule for a firstreceptor relative to a second receptor refers to the following ratio:EC₅₀ of the molecule at the second receptor divided by the EC₅₀ of themolecule at the first receptor. For example, a molecule that has an EC₅₀of 1 nM at a first receptor and an EC₅₀ of 100 nM at a second receptorhas 100-fold selectivity for the first receptor relative to the secondreceptor.

As used herein an amino acid “modification” refers to a substitution ofan amino acid, or the derivation of an amino acid by the addition and/orremoval of chemical groups to/from the amino acid, and includessubstitution with any of the 20 amino acids commonly found in humanproteins, as well as atypical or non-naturally occurring amino acids.Commercial sources of atypical amino acids include Sigma-Aldrich(Milwaukee, Wis.), ChemPep Inc. (Miami, Fla.), and GenzymePharmaceuticals (Cambridge, Mass.). Atypical amino acids may bepurchased from commercial suppliers, synthesized de novo, or chemicallymodified or derivatized from naturally occurring amino acids.

As used herein an amino acid “substitution” refers to the replacement ofone amino acid residue by a different amino acid residue.

As used herein, the term “conservative amino acid substitution” isdefined herein as exchanges within one of the following five groups:

I. Small aliphatic, nonpolar or slightly polar residues:

-   -   Ala, Ser, Thr, Pro, Gly;

II. Polar, negatively charged residues and their amides:

-   -   Asp, Asn, Glu, Gln, cysteic acid and homocysteic acid;

III. Polar, positively charged residues:

-   -   His, Arg, Lys; Ornithine (Orn)

IV. Large, aliphatic, nonpolar residues:

-   -   Met, Leu, Ile, Val, Cys, Norleucine (Nle), homocysteine

V. Large, aromatic residues:

-   -   Phe, Tyr, Trp, acetyl phenylalanine

As used herein the general term “polyethylene glycol chain” or “PEGchain”, refers to mixtures of condensation polymers of ethylene oxideand water, in a branched or straight chain, represented by the generalformula H(OCH₂CH₂)_(n)OH, wherein n is at least 2. “Polyethylene glycolchain” or “PEG chain” is used in combination with a numeric suffix toindicate the approximate average molecular weight thereof. For example,PEG-5,000 refers to polyethylene glycol chain having a total molecularweight average of about 5,000 Daltons.

As used herein the term “pegylated” and like terms refers to a compoundthat has been modified from its native state by linking a polyethyleneglycol chain to the compound. A “pegylated polypeptide” is a polypeptidethat has a PEG chain covalently bound to the polypeptide.

As used herein an “acylated” amino acid is an amino acid comprising anacyl group which is non-native to a naturally-occurring amino acid,regardless by the means by which it is produced. Exemplary methods ofproducing acylated amino acids and acylated peptides are known in theart and include acylating an amino acid before inclusion in the peptideor peptide synthesis followed by chemical acylation of the peptide. Insome embodiments, the acyl group causes the peptide to have one or moreof (i) a prolonged half-life in circulation, (ii) a delayed onset ofaction, (iii) an extended duration of action, (iv) an improvedresistance to proteases, and (v) increased potency at the insulinreceptor.

As used herein, an “alkylated” amino acid is an amino acid comprising analkyl group which is non-native to a naturally-occurring amino acid,regardless of the means by which it is produced. Exemplary methods ofproducing alkylated amino acids and alkylated peptides are known in theart and including alkylating an amino acid before inclusion in thepeptide or peptide synthesis followed by chemical alkylation of thepeptide. Without being held to any particular theory, it is believedthat alkylation of peptides will achieve similar, if not the same,effects as acylation of the peptides, e.g., a prolonged half-life incirculation, a delayed onset of action, an extended duration of action,an improved resistance to proteases, and increased potency at theinsulin receptor.

As used herein a “linker” is a bond, molecule or group of molecules thatbinds two separate entities to one another. Linkers may provide foroptimal spacing of the two entities or may further supply a labilelinkage that allows the two entities to be separated from each other.Labile linkages include photocleavable groups, acid-labile moieties,base-labile moieties and enzyme-cleavable groups.

As used herein an “IGF dimer” is a complex comprising two IGF YL analogpeptides (each itself comprising an A chain and a B chain) covalentlybound to one another via a linker. The term IGF dimer, when used absentany qualifying language, encompasses both IGF homodimers and IGFheterodimers. An IGF homodimer comprises two identical subunits, whereasan IGF heterodimer comprises two subunits that differ, although the twosubunits are substantially similar to one another.

The term “C₁-C_(n) alkyl” wherein n can be from 1 through 6, as usedherein, represents a branched or linear alkyl group having from one tothe specified number of carbon atoms. Typical C₁-C₆ alkyl groupsinclude, but are not limited to, methyl, ethyl, n-propyl, iso-propyl,butyl, iso-Butyl, sec-butyl, tert-butyl, pentyl, hexyl and the like.

The terms “C₂-C_(n) alkenyl” wherein n can be from 2 through 6, as usedherein, represents an olefinically unsaturated branched or linear grouphaving from 2 to the specified number of carbon atoms and at least onedouble bond. Examples of such groups include, but are not limited to,1-propenyl, 2-propenyl (—CH₂—CH═CH₂), 1,3-butadienyl, (—CH═CHCH═CH₂),1-butenyl (—CH═CHCH₂CH₃), hexenyl, pentenyl, and the like.

The term “C₂-C_(n) alkynyl” wherein n can be from 2 to 6, refers to anunsaturated branched or linear group having from 2 to n carbon atoms andat least one triple bond. Examples of such groups include, but are notlimited to, 1-propynyl, 2-propynyl, 1-butynyl, 2-butynyl, 1-pentynyl,and the like.

As used herein the term “aryl” refers to a mono- or bicyclic carbocyclicring system having one or two aromatic rings including, but not limitedto, phenyl, naphthyl, tetrahydronaphthyl, indanyl, indenyl, and thelike. The size of the aryl ring and the presence of substituents orlinking groups are indicated by designating the number of carbonspresent. For example, the term “(C₁-C₃ alkyl)(C₆-C₁₀ aryl)” refers to a5 to 10 membered aryl that is attached to a parent moiety via a one tothree membered alkyl chain.

The term “heteroaryl” as used herein refers to a mono- or bi-cyclic ringsystem containing one or two aromatic rings and containing at least onenitrogen, oxygen, or sulfur atom in an aromatic ring. The size of theheteroaryl ring and the presence of substituents or linking groups areindicated by designating the number of carbons present. For example, theterm “(C₁-C_(n) alkyl)(C₅-C₆ heteroaryl)” refers to a 5 or 6 memberedheteroaryl that is attached to a parent moiety via a one to “n” memberedalkyl chain.

As used herein, the term “halo” refers to one or more members of thegroup consisting of fluorine, chlorine, bromine, and iodine.

As used herein the term “patient” without further designation isintended to encompass any warm blooded vertebrate domesticated animal(including for example, but not limited to livestock, horses, cats, dogsand other pets) and humans.

The term “isolated” as used herein means having been removed from itsnatural environment. In some embodiments, the analog is made throughrecombinant methods and the analog is isolated from the host cell.

The term “purified,” as used herein relates to the isolation of amolecule or compound in a form that is substantially free ofcontaminants normally associated with the molecule or compound in anative or natural environment and means having been increased in purityas a result of being separated from other components of the originalcomposition. The term “purified polypeptide” is used herein to describea polypeptide which has been separated from other compounds including,but not limited to nucleic acid molecules, lipids and carbohydrates.

A “peptidomimetic” refers to a chemical compound having a structure thatis different from the general structure of an existing peptide, but thatfunctions in a manner similar to the existing peptide, e.g., bymimicking the biological activity of that peptide. Peptidomimeticstypically comprise naturally-occurring amino acids and/or unnaturalamino acids, but can also comprise modifications to the peptidebackbone. For example a peptidomimetic may include a sequence ofnaturally-occurring amino acids with the insertion or substitution of anon-peptide moiety, e.g. a retroinverso fragment, or incorporation ofnon-peptide bonds such as an azapeptide bond (CO substituted by NH) orpseudo-peptide bond (e.g. NH substituted with CH₂), or an ester bond(e.g., depsipeptides, wherein one or more of the amide (—CONHR—) bondsare replaced by ester (COOR) bonds). Alternatively the peptidomimeticmay be devoid of any naturally-occurring amino acids.

As used herein the term “charged amino acid” or “charged residue” refersto an amino acid that comprises a side chain that is negatively charged(i.e., de-protonated) or positively charged (i.e., protonated) inaqueous solution at physiological pH. For example, negatively chargedamino acids include aspartic acid, glutamic acid, cysteic acid,homocysteic acid, and homoglutamic acid, whereas positively chargedamino acids include arginine, lysine and histidine. Charged amino acidsinclude the charged amino acids among the 20 amino acids commonly foundin human proteins, as well as atypical or non-naturally occurring aminoacids.

As used herein the term “acidic amino acid” refers to an amino acid thatcomprises a second acidic moiety (other than the alpha carboxylic acidof the amino acid), including for example, a side chain carboxylic acidor sulfonic acid group.

ABBREVIATIONS

Insulin analogs will be abbreviated as follows:

The insulin A and B chains will be designated by a capital A for the Achain and a capital B for the B chain wherein a superscript 0 (e.g., A⁰or B⁰ will designate the base sequence is an insulin sequence (A chain:SEQ ID NO: 1, B chain SEQ ID NO: 2) and a superscript 1 (e.g., A¹ or B¹)will designate the base sequence is an IGF-1 sequence (A chain: SEQ IDNO: 5, B chain SEQ ID NO: 6). Modifications that deviate from the nativeinsulin and IGF sequence are indicated in parenthesis following thedesignation of the A or B chain (e.g., [B¹(H5,H10,Y16,L17):A¹(H8,N18,N21)]) with the single letter amino acid abbreviationindicating the substitution and the number indicating the position ofthe substitution in the respective A or B chain, using native insulinnumbering. A colon between the A and B chain indicates a two chaininsulin whereas a dash will indicate a covalent bond and thus a singlechain analog. In single chain analogs a linking moiety will be includedbetween the A and B chains and the designation C¹ refers to the nativeIGF 1 C peptide, SEQ ID NO: 17. The designation “position C8” inreference to the linking moiety designates an amino acid located at theposition corresponding to the eighth amino acid of SEQ ID NO: 17.

Embodiments

As disclosed herein applicants have discovered high potency single chaininsulin analogs. More particularly, applicants have discovered uniquelinking moieties that can be used to covalently link the B chain and Achain of human insulin, or analogs or derivatives thereof to form a highpotency linear single chain insulin agonists. In one embodiment thelinking moiety covalently bonds the carboxy terminus of the B chain tothe amino terminus of the A chain.

As disclosed herein optimally sized linking moieties can be used to linkhuman insulin A and B chains, or analogs or derivatives thereof, whereinthe carboxy terminus of the B25 amino acid of the B chain is directlylinked to a first end of a linking moiety, wherein the second end of thelinking moiety is directly linked to the amino terminus of the A1 aminoacid of the A chain via the intervening linking moiety. In oneembodiment the linking moiety comprises an 8 to 17 amino acid peptide,and more particularly, in one embodiment the peptide represents ananalog of the IGF-1 C peptide. In another embodiment the linking moietycomprises a relatively short bifunctional non-peptide polymer linkerthat approximates the length of an 8-16 amino acid sequence. Inaccordance with one embodiment the non-peptide linking moiety is apolyethylene glycol linker of approximately 4 to 20, 8 to 18, 8 to 16, 8to 14, 8 to 12, 10 to 14, 10 to 12 or 11 to 13 monomers.

In one embodiment a single chain insulin agonist analog is provided thatcomprises the general structure B-LM-A wherein B represents an insulin Bchain, A represents an insulin A chain, and LM represents a linkingmoiety linking the carboxy terminus of the B chain to the amino terminusof the A chain. The insulin A and B chains can be any known insulinsequence, including those disclosed herein, that when linked together asa heteroduplex form a functional insulin. Applicants have discovered avariety of linking moieties as disclosed herein that can be used to linkthe insulin A and B chain together to generate an active single chaininsulin analog. In accordance with one embodiment the linking moietyfurther comprises a hydrophilic moiety linked to the side chain of anamino acid of the linking moiety and/or at a position selected from thegroup consisting of A9, A14 and A15 of the A chain or at the N-terminalalpha amine (positions B1, B2) or the side chain of an amino acid atpositions B10, B22, B28 or B29 of the B chain. In one embodiment thehydrophilic moiety is a polyethylene chain that is linked to an aminoacid of the linking moiety and/or at the N-terminal alpha amine of the Bchain. In one embodiment the linking moiety (LM) comprises an amino acidsequence of no more than 17 amino acids in length and comprising thesequence X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈ (SEQ ID NO: 9), wherein

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₃ through X₅₆ are each independently any amino acid; and

X₅₇ and X₅₈ are independently arginine, lysine, cysteine, homocysteine,acetyl-phenylalanine or ornithine. In one embodiment the linking moietyfurther comprises a hydrophilic moiety linked to the side chain of anamino acid of the linking moiety. In one embodiment the linking moietycomprises the sequence X₅₁X₅₂GSSSX₅₇X₅₈ (SEQ ID NO: 29) orX₅₁X₅₂GSSSX₅₇X₅₈APQT (SEQ ID NO: 46) wherein X₅₁ is selected from thegroup consisting of glycine, alanine, valine, leucine, isoleucine andproline, X₅₂ is alanine, valine, leucine, isoleucine or proline and X₅₇or X₅₈ are independently arginine, lysine, cysteine, homocysteine,acetyl-phenylalanine or ornithine, wherein a hydrophilic moiety islinked to the side chain of the amino acid at position 7 or 8 of thelinking moiety (i.e., at the X₅₇ or X₅₈ position). Amino acid positionsof the linking moiety are designated based on the corresponding positionin the native C chain of IGF 1 (SEQ ID NO: 17).

In accordance with one embodiment a single chain insulin agonistpolypeptide is provided comprising a B chain and A chain of humaninsulin, or analogs or derivative thereof, wherein the last five carboxyamino acids of the native B chain are deleted (i.e., B26-B30), and aminoacid B25 is linked to amino acid A1 of the A chain via an interveninglinking moiety. In one embodiment the linking moiety has the generalstructure:Y₁—Z

wherein

Y₁ is selected from the group consisting of a bond, X₄₆, X₄₆X₄₇,X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQ ID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO:13) wherein each of X₄₆, X₄₇, X₄₈, X₄₉ and X₅₀ represent any amino acidor amino acid analog or derivative thereof; and

Z represents an amino acid sequence at least 8 amino acids and no morethan 16 amino acid in length and comprising the sequenceX₅₁X₅₂X₅₃X₅₄X₅₅X₅₆RR (SEQ ID NO: 10), wherein

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine; and

X₅₂ through X₅₆ are each independently any amino acid or amino acidanalog or derivative thereof. In one embodiment X₅₂ is any amino acidother than tyrosine. In a further embodiment X₅₂ is any non-aromaticamino acid, and in one embodiment X₅₂ is alanine, valine, leucine,isoleucine or proline. In a further embodiment X₅₁ is selected from thegroup consisting of glycine, alanine, valine, leucine, isoleucine,proline, phenylalanine and methionine; X₅₂ is any amino acid other thantyrosine; and

X₅₃, X₅₄, X₅₅ and X₅₆ are independently selected from the groupconsisting of glycine, alanine, serine, threonine and proline. In onefurther embodiment, X₅₁ is selected from the group consisting ofglycine, alanine, valine, leucine, isoleucine, proline, phenylalanineand methionine; X₅₂ is selected from the group consisting of glycine,alanine, valine, leucine, isoleucine, serine, threonine and proline; X₅₃is other than glycine; X₅₄, and X₅₅ are independently selected from thegroup consisting of glycine, alanine, serine, threonine and proline andX₅₆ is serine.

In another embodiment the linking moiety comprises the generalstructure:Y₁-W

wherein

Y₁ is selected from the group consisting of a bond, X₄₆, X₄₆X₄₇,X₄₆X₄₇X₄₈, and X₄₆X₄₇X₄₈X₄₉ (SEQ ID NO: 24) wherein each of X₄₆, X₄₇,X₄₈, and X₄₉ represent any amino acid or amino acid analog or derivativethereof, with the proviso that Y₁ is not YTPKT (SEQ ID NO: 16) or FNKPT(SEQ ID NO: 76); and W represents a polyethylene glycol of 2-16 monomerunits.

In one embodiment a single chain insulin analog is provided comprisingan A chain and a C-terminally truncated B chain, having amino acidsB26-B30 (relative to the native insulin sequence) removed, wherein saidA chain and B chain are human insulin sequences, or analogs orderivatives thereof, further wherein the carboxy terminus of the B25amino acid of the B chain is directly linked to a first end of a linkingmoiety and a second end of the linking moiety is directly linked to theamino terminus of the A1 amino acid of the A chain, further wherein, inone embodiment, the linking moiety does not comprise the sequence YTPKT(SEQ ID NO: 16) or FNKPT (SEQ ID NO: 76). In one embodiment theC-terminally truncated B chain comprises an analog of a peptiderepresenting amino acids 5-25 of SEQ ID NO: 2, wherein said analogdiffers from the corresponding amino acids 5-25 of SEQ ID NO: 2 by 1, 1to 2, 3 to 4, 4 to 6 or up to 8 amino acid substitutions at amino acidpositions selected from 5, 9, 10, 13, 14, 21, 22 and 25. In oneembodiment the C-terminally truncated B chain comprises an analog of apeptide representing amino acids 1-25 of SEQ ID NO: 2, wherein saidanalog differs from the corresponding amino acids 1-25 of SEQ ID NO: 2by 1, 1 to 2, 3 to 4, 4 to 6, 4 to 8 or up to 10 amino acidsubstitutions at amino acid positions selected from 2, 3, 4, 5, 9, 10,13, 14, 21, 22 and 25. In one embodiment the C-terminally truncated Bchain comprises a peptide having at least 70%, 75%, 80%, 90% or 95%sequence identity with the corresponding amino acids 5-25 of SEQ ID NO:2. In one embodiment the A chain is an analog of SEQ ID NO: 1 whereinthe analog differs from SEQ ID NO: 1 by 1, 1 to 2, 3 to 4, 4 to 8 or upto 10 amino acid substitutions at amino acid positions selected from 4,5, 8, 9, 10, 12, 14, 15, 18 and 21. In one embodiment the A chaincomprises a peptide having at least 70%, 75%, 80%, 90% or 95% sequenceidentity with SEQ ID NO: 1.

In one embodiment the linking moiety comprising

-   -   a) a polyethylene glycol of 6-16 monomer units;    -   b) a non-native amino acid sequence of at least 8 amino acids        and no more than 17 amino acid in length and comprising the        sequence GYGSSSX₅₇R (SEQ ID NO: 51) or X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈        (SEQ ID NO: 9), or a peptidomimetic thereof; or    -   c) a combination of said polyethylene glycol and a non-native        amino acid sequence of 1 to 4 amino acids;

wherein

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₃ through X₅₆ are each independently any amino acid; and

X₅₇ and X₅₈ are independently arginine, lysine or ornithine.

In one embodiment a single chain insulin agonist analog is providedcomprising the general structure B-LM-A wherein B represents an insulinB chain comprising the sequence X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGDX₄₂GFX₄₅ (SEQID NO: 58), A represents an insulin A chain comprising the sequence

GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22), and LMrepresents a peptide linking moiety linking the carboxy terminus of theB chain to the amino terminus of the A chain. In one embodiment thelinking moiety comprises an amino acid sequence of no more than 17 aminoacid in length and comprising the sequence X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈ (SEQID NO: 9), wherein

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamine, glutamic acid, arginine, aspartic acid or lysine,ornithine

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid and glutamicacid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₂ is selected from the group consisting of alanine, ornithine andarginine;

X₄₅ is selected from the group consisting of tyrosine, histidine,asparagine and phenylalanine;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₃ through X₅₆ are each independently any amino acid; and

X₅₇ and X₅₈ are independently arginine, lysine, cysteine, homocysteine,acetyl-phenylalanine or ornithine. In one embodiment the single chainanalog further comprises a hydrophilic moiety covalently linked to theside chain of an amino acid of the linking moiety or at the N-terminalalpha amine of the B chain, or to the side chain of an amino acid at aposition selected from the group consisting of A9, A14 and A15 of the Achain or positions B1, B2, B10, B22, B28 or B29 of the B chain. In oneembodiment the hydrophilic moiety (e.g., PEG) is linked to theN-terminal alpha amine of the B chain. In one embodiment one to fiveamino acids corresponding to B26-B30 are removed from the B chaincarboxy terminus and the remaining carboxy terminal amino acid is linkeddirectly to the amino terminus of the linking moiety. In one embodimentthe B chain comprises the sequence

X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGDX₄₂GFX₄₅ (SEQ ID NO: 58), the linking moietycomprises the sequence (Y₁)_(k)—X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈(Y₂)_(n) (SEQ IDNO: 9), and the A chain comprises the sequenceGIVX₄ECCX₈X₉SCDLX₁₄X₁₅LEX₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 19).

The Linking Moiety

Peptide Linkers

In accordance with one embodiment the linking moiety is a derivative ofthe IGF 1 C chain sequence (GYGSSSRRAPQT; SEQ ID NO: 17). In oneembodiment the derivative is a peptide that differs from SEQ ID NO: 17by a single amino acid substitution of a lysine, cysteine ornithine,homocysteine, or acetyl-phenylalanine residue, and in a furtherembodiment the lysine, cysteine ornithine, homocysteine, oracetyl-phenylalanine amino acid is pegylated. In one further embodimentthe linking moiety is a peptide that differs from SEQ ID NO: 17 by asingle lysine substitution. In one specific embodiment the substitutionis made at position 8 of SEQ ID NO: 17. Applicants have discovered thatuse of the IGF 1 C chain sequence and analogs thereof as a linkingmoiety will generate a single chain insulin polypeptide that has nearwild type insulin activity. Furthermore, use of a IGF 1 C chain sequenceanalog as the linking moiety, wherein position 2 of the IGF 1 C chainsequence is modified, or the carboxy terminal four amino acids aredeleted from the IGF 1 C chain sequence, produces a single chain insulinpolypeptide that is selective for insulin (i.e., has a higher bindingand/or activity at the insulin receptor compared to the IGF-1 receptor).In one embodiment the single chain insulin polypeptide has 5×, 10×, 20×,30×, 40×, or 50× higher affinity or activity at the insulin receptorrelative to the IGF-1 receptor.

In accordance with one embodiment the linking moiety is a derivative ofthe IGF 1 C chain sequence (GYGSSSRRAPQT; SEQ ID NO: 17) and comprises anon-native sequence that differs from GYGSSSRR (SEQ ID NO: 18) orGAGSSSRRAPQT (SEQ ID NO: 36) by 1 to 3 amino acid substitutions, or 1 to2 amino acid substitutions. In one embodiment at least one of the aminoacid substitutions is a lysine or cysteine substitution, and in oneembodiment the amino acid substitutions are conservative amino acidsubstitutions. In one embodiment the linking moiety is a peptide (orpeptidomimetic) of 8 to 17 amino acids comprising a non-native aminoacid sequence that differs from GYGSSSRR (SEQ ID NO: 18) or GAGSSSRRAPQT(SEQ ID NO: 36) by 1 amino acid substitution, including for examplesubstitution with a lysine or cysteine. In one embodiment the linkingmoiety comprises the sequence GYGSSSRR (SEQ ID NO: 18) or GAGSSSRRAPQT(SEQ ID NO: 36). In one embodiment the linking moiety comprises thesequence GAGSSSRX₅₈APQT (SEQ ID NO: 167), GYGSSSX₅₇X₅₈APQT (SEQ ID NO:37), or an amino acid that differs from SEQ ID NO: 167 by a single aminoacid substitution, wherein X₅₇ is arginine and X₅₈ is arginine,ornithine or lysine, and in a further embodiment a polyethylene glycolchain is linked to the side chain of the amino acid at position 8 ofsaid linking moiety. In another embodiment the linking moiety comprisesthe sequence GX₅₂GSSSRX₅₈APQT (SEQ ID NO: 38), wherein X₅₂ is anynon-aromatic amino acid, including for example, alanine, valine,leucine, isoleucine or proline, and X₅₈ represents an amino acid thathas a polyethylene chain covalently linked to its side chain. In oneembodiment X₅₈ is a pegylated lysine. In one embodiment the linkingmoiety comprises the sequence GYGSSSRX₅₈ (SEQ ID NO: 45) orGAGSSSRX₅₈APQT (SEQ ID NO: 167), wherein X₅₈ represents an amino acidthat has a polyethylene chain covalently linked to its side chain.

In accordance with one embodiment the linking moiety is a peptide orpeptidomimetic of 6-18, 8-18, 8-17, 8-12, 8-10, 13-17 or 13-15 aminoacids (or amino acid analogs or derivatives thereof) wherein the peptidelinking moiety comprises two or more adjacent basic amino acid residues.In accordance with one embodiment the linking moiety is an 8 to 17non-native amino acid sequence comprising the sequenceX₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈ (SEQ ID NO: 9), wherein X₅₁, X₅₂, X₅₃, X₅₄, X₅₅and X₅₆ are independently any amino acid or amino acid analog orderivative thereof, and X₅₇ and X₅₈ are basic amino acids. In accordancewith one embodiment the linking moiety is an 8 to 12 non-native aminoacid sequence comprising the sequence X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇X₅₈ (SEQ ID NO:70) that joins a full length B chain to a full length A chain, whereinX₅₁, X₅₃, X₅₄, X₅₅ and X₅₆ are independently any amino acid or aminoacid analog or derivative thereof, and X₅₇ and X₅₈ are basic aminoacids. In one embodiment one of X₅₇ and X₅₈ represents a pegylatedlysine reside. In a further embodiment X₅₇ and X₅₈ are independentlyselected from the group consisting of arginine, lysine and ornithine.

In one embodiment the linking moiety is a peptide or peptidomimetic of8-12, 8-10, 13-17 or 13-15 amino acids and comprising the sequenceX₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇R (SEQ ID NO: 25), wherein X₅₁, X₅₂, X₅₃, X₅₄ andX₅₅ are independently any amino acid or amino acid analog or derivativethereof and X₅₇ is arginine, lysine or ornithine. In one embodiment X₅₁is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine and X₅₂ isany amino acid other than tyrosine. In one embodiment X₅₂ is anon-aromatic amino acid and in one specific embodiment X₅₂ is alanine orproline. In one embodiment the linking moiety is pegylated at the sidechain of an amino acid, and in a further embodiment the amino acid atposition 8 of the linking moiety is pegylated.

In another embodiment the linking moiety is a peptide or peptidomimeticof 8-12, 8-10, 13-17 or 13-15 amino acids and comprising the sequenceX₅₁X₅₂X₅₃X₅₄X₅₅SRR (SEQ ID NO: 26), wherein X₅₁, X₅₂, X₅₃, X₅₄ and X₅₅are independently any amino acid or amino acid analog or derivativethereof. In accordance with one embodiment

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₃, X₅₄, X₅₅ and X₅₆ are independently selected from the groupconsisting of glycine, alanine, serine, threonine and proline. In oneembodiment, X₅₁ and X₅₂ are independently selected from the groupconsisting of glycine, alanine, valine, leucine, isoleucine, andproline.

In one embodiment the linking moiety is a non-native polypeptide of 8 to17 amino acids in length and comprising the sequenceX₅₁X₅₂X₅₃X₅₄X₅₅X₅₆RR (SEQ ID NO: 10), wherein X₅₂ is a non-aromaticamino acid, including for example alanine. In one embodiment the linkingmoiety is 8 to 17 amino acids in length and comprises the sequenceX₅₁X₅₂GSSSRR (SEQ ID NO: 27) wherein X₅₁ is selected from the groupconsisting of glycine, alanine, valine, leucine, isoleucine, proline andmethionine, and X₅₂ is a non-aromatic amino acid, including for examplealanine. In one embodiment the linking moiety is 8 to 17 amino acids inlength and comprises a sequence that differs from X₅₁X₅₂GSSSRR (SEQ IDNO: 27) by a single amino acid substitution wherein the amino acidsubstitution is an amino acid that is pegylated at its side chain,further wherein X₅₁ is selected from the group consisting of glycine,alanine, valine, leucine, isoleucine, proline and methionine, and X₅₂ isa non-aromatic amino acid, including for example alanine.

In one embodiment the linking moiety is an 8 to 17 amino acid sequencecomprising the sequence X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇R (SEQ ID NO: 28) or apeptidomimetic thereof, wherein X₅₁, X₅₃, X₅₄, X₅₅, X₅₆, and X₅₇ areindependently any amino acid or amino acid analog or derivative thereof.In one embodiment X₅₁ is selected from the group consisting of glycine,alanine, valine, leucine, isoleucine, proline, phenylalanine andmethionine; and X₅₃, X₅₄, X₅₅ and X₅₆ are independently selected fromthe group consisting of glycine, alanine, serine, threonine and proline,and X₅₇ is a basic amino acid, including for example, arginine, lysineor ornithine.

In one embodiment the linking moiety is an 8 to 17 amino acid sequencecomprising the sequence X₅₁X₅₂GSSSX₅₇X₅₈ (SEQ ID NO: 29) orX₅₁X₅₂GSSSX₅₇X₅₈APQT (SEQ ID NO: 46) wherein X₅₁ is selected from thegroup consisting of glycine, alanine, valine, leucine, isoleucine,proline, phenylalanine and methionine, X₅₂ is a non-aromatic amino acidand X₅₇ and X₅₈ are independently selected from the group consisting ofarginine, lysine and ornithine. In one embodiment the linking moietyfurther comprises a polyethylene glycol chain linked to the side chainof an amino acid of the linking moiety, including for example, the aminoacid at position 8 of the linking moiety. In a further embodiment thelinking moiety is an 8 to 17 amino acid sequence comprising the sequenceX₅₁X₅₂GSSSRR (SEQ ID NO: 27), a peptidomimetic of SEQ ID NO: 27, or anamino acid sequence that differs from SEQ ID NO: 27 by a single aminoacid at one of positions 3-8 of SEQ ID NO: 27, wherein X₅₁ is selectedfrom the group consisting of glycine, alanine, valine, leucine,isoleucine, proline and methionine, and X₅₂ is any amino acid, with theproviso that when the linking peptide is longer than 8 amino acids X₅₂is other than tyrosine. In one embodiment the linking moiety is an 8 to17 amino acid sequence consisting of the sequence X₅₁X₅₂GSSSRR (SEQ IDNO: 27), a peptidomimetic of SEQ ID NO: 27, or an amino acid sequencethat differs from SEQ ID NO: 27 by 1, 2, or 3 amino acid substitutionsat one of positions 3-8 of SEQ ID NO: 27, wherein X₅₁ is selected fromthe group consisting of glycine, alanine, valine, leucine, isoleucine,proline and methionine, and X₅₂ is any amino acid. In one embodiment thelinking moiety is a peptide of eight amino acids in length and comprisesthe sequence GYGSSSRR (SEQ ID NO: 18), or an amino acid sequence thatdiffers from SEQ ID NO: 18 by a single amino acid substitution, or aderivative thereof.

In one embodiment the linking moiety is at least 8 but no more than 17amino acids in length and comprises the sequence(Y₁)_(k)—X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈(Y₂) (SEQ ID NO: 9), wherein

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine and proline;

X₅₂ is alanine, valine, leucine, isoleucine or proline; and

X₅₃ through X₅₆ are each independently any amino acid; and

X₅₇ and X₅₈ are independently arginine, lysine, cysteine, homocysteine,acetyl-phenylalanine or ornithine;

k is 0 or 1;

Y₁ is selected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13);

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15);

X₄₆ through X₅₀ and X₇₀ through X₇₃ are each independently any aminoacid. In one embodiment the linking peptide is pegylated. In oneembodiment k is 0, and Y₂ is X₇₀X₇₁X₇₂X₇₃ (SEQ ID NO: 15). In analternative embodiment k is 1 and Y₂ is X₇₀X₇₁X₇₂X₇₃ (SEQ ID NO: 15). Inone embodiment

X₄₆ is phenylalanine or tyrosine;

X₄₇ is asparagine or threonine;

X₄₈ is an aspartate-lysine dipeptide, an arginine-proline dipeptide, alysine-proline dipeptide, or a proline-lysine dipeptide; and

X₄₉ is threonine. In one embodiment X₇₀ is alanine, X₇₁ is proline, X₇₂is glutamine and

X₇₃ is threonine. In one embodiment k is 0, and Y₂ is APQT (SEQ ID NO:82). In one embodiment, Y₁ is selected from the group consisting of F,Y, FN, YT, FD, FE, YD, and YE. In one embodiment when the insulin Bchain is not native insulin (SEQ ID NO: 2) or native IGF-1 (SEQ ID NO:6), Y₁ is FNKPT (SEQ ID NO: 76) or FNPKT SEQ ID NO: 81).

In one embodiment the linking moiety is a 12 amino acid sequenceconsisting of the sequence X₅₁AGSSSRRAPQT (SEQ ID NO: 30), or an aminoacid sequence that differs from SEQ ID NO: 30 by a one to three aminoacid substitutions at positions selected from positions 3-12 of SEQ IDNO: 30, wherein X₅₁ is selected from the group consisting of glycine,alanine, valine, leucine, isoleucine, proline and methionine, or apeptidomimetic of SEQ ID NO: 30. In one embodiment the linking moiety isa 12 amino acid sequence consisting of the sequence X₅₁AGSSSRRAPQT (SEQID NO: 30), or an amino acid sequence that differs from SEQ ID NO: 30 bya single lysine or cysteine amino acid substitution at a positionselected from positions 3-10 of SEQ ID NO: 30, wherein X₅₁ is selectedfrom the group consisting of glycine, alanine, valine, leucine,isoleucine, proline and methionine, or a peptidomimetic of SEQ ID NO:30. In one embodiment the lysine or cysteine amino acid substitution isa pegylated lysine or cysteine amino acid.

In another embodiment, the linking moiety is an 8 to 17 amino acidsequence comprising the sequence GX₅₂GSSSRR (SEQ ID NO: 31), wherein X₅₂is any amino acid, a peptidomimetic of SEQ ID NO: 31, or an analogthereof that differs from SEQ ID NO: 31 by a single amino acidsubstitution at any of positions 1, 3, 4, 5, 6, 7 or 8 of SEQ ID NO: 31,with the proviso that when the linking peptide is longer than 8 aminoacids X₅₂ is other than tyrosine. In accordance with one embodiment thelinking moiety comprises an 8-17 amino acid sequence selected from thegroup consisting of GYGSSSRR (SEQ ID NO: 18), GAGSSSRR (SEQ ID NO: 32),GAGSSSRRA (SEQ ID NO: 33), GAGSSSRRAP (SEQ ID NO: 34), GAGSSSRRAPQ (SEQID NO: 35), GAGSSSRRAPQT (SEQ ID NO: 36), PYGSSSRR (SEQ ID NO: 39),PAGSSSRR (SEQ ID NO: 40), PAGSSSRRA (SEQ ID NO: 41), PAGSSSRRAP (SEQ IDNO: 42), PAGSSSRRAPQ (SEQ ID NO: 43), PAGSSSRRAPQT (SEQ ID NO: 44). Inaccordance with one embodiment the linking moiety comprises an aminoacid sequence that differs from GYGSSSRR (SEQ ID NO: 18), GAGSSSRR (SEQID NO: 32), GAGSSSRRA (SEQ ID NO: 33), GAGSSSRRAP (SEQ ID NO: 34),GAGSSSRRAPQ (SEQ ID NO: 35), GAGSSSRRAPQT (SEQ ID NO: 36), PYGSSSRR (SEQID NO: 39), PAGSSSRR (SEQ ID NO: 40), PAGSSSRRA (SEQ ID NO: 41),PAGSSSRRAP (SEQ ID NO: 42), PAGSSSRRAPQ (SEQ ID NO: 43), PAGSSSRRAPQT(SEQ ID NO: 44) by a single pegylated amino acid including for example apegylated lysine or pegylated cysteine amino acid substitution. In oneembodiment the pegylated amino acid is at position 8 of the linkingmoiety

Non-Peptide Linkers

In one embodiment the linking moiety is a relatively short bifunctionalnon-peptide polymer linker that approximates the length of an 8-16 aminoacid sequence. In accordance with one embodiment the non-peptide linkingmoiety is a polyethylene glycol linker of approximately 4 to 20, 8 to18, 8 to 16, 8 to 14, 10 to 14, 10 to 12 or 11 to 13 monomers. In oneembodiment a single chain insulin agonist is provided wherein the lastfive carboxy amino acids of the native B chain are deleted, and aminoacid B25 is directly linked to the linking moiety by a covalent bond.The second end of the linking moiety is covalently bound to amino acidA1 of the A chain thus linking the B and A chain via the linking moiety.In one embodiment the linking moiety is a linear polyethylene glycollinking moiety comprising at least 10 but no more than 16 monomer unitsand in another embodiment the polyethylene glycol linking moietycomprises at least 12 but no more than 16 monomer units, and in afurther embodiment the polyethylene glycol linking moiety comprises atleast 10 but no more than 14 monomer units.

In accordance with one embodiment the polyethylene glycol linking moietycomprises the structure:

wherein m is an integer ranging from 6 to 18, 8 to 16, 10 to 14 or 11 to13. In one embodiment m is an integer selected from 10, 11, 12, 13 or14. In one embodiment m is 12.

In one embodiment a single chain insulin agonist is provided wherein thelast five carboxy amino acids of the native B chain are deleted, andamino acid B25 is linked to amino acid A1 of the A chain via a linkingmoiety comprising polyethylene glycol of at least 8 but no more than 16monomer units and an amino acid sequence of one to four amino acids. Inaccordance with one embodiment the linking moiety comprises a 1-4 aminoacid sequence and a linear polyethylene glycol of at least 8 but lessthan 14 monomer units in length covalently bound to said 1-4 amino acidsequence, with the proviso that the amino acid sequence is not YTPK (SEQID NO: 78) or FNKP (SEQ ID NO: 77). In another embodiment a single chaininsulin agonist is provided wherein the last five carboxy amino acids ofthe native B chain are deleted, and amino acid B25 is linked to aminoacid A1 of the A chain via a linking moiety comprising a polyethyleneglycol of at least 8 but less than 14 monomer units in length and a 2-5amino acid sequence. The 2-5 amino acid sequence can be located betweenthe B chain and the polyethylene glycol chain or between the A chain andthe polyethylene glycol chain. However, when the 2-5 amino acid sequenceis located between the B chain and the polyethylene glycol chain, theamino acid sequence is not YTPKT (SEQ ID NO: 16) or FNKPT (SEQ ID NO:76).

In one embodiment the linking moiety comprises the general structure:W₁-Z₁—Y₁

wherein

W₁ and Y₁ are independently a bond, X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉(SEQ ID NO: 24) or X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13), with the provisothat W₁ is not YTPK (SEQ ID NO: 78) or FNKP (SEQ ID NO: 77) and Z₁represents a polyethylene glycol of the general structure

wherein m is an integer ranging from 6-14, and each of X₄₆, X₄₇, X₄₈,X₄₉ and X₅₀ are independently any amino acid. In one embodiment X₄₆,X₄₇, X₄₈, X₄₉ and X₅₀ are independently any non-native amino acidrelative to positions B26-B30 of insulin or IGF-1. In one embodimentX₄₆, X₄₇, X₄₈, X₄₉ and X₅₀ are independently selected from the groupconsisting of glycine, alanine, valine, leucine, isoleucine, serine,threonine and proline, and in a further embodiment X₄₆, X₄₇, X₄₈, X₄₉and X₅₀ are independently selected from the group consisting of glycine,alanine, valine, leucine and isoleucine. In one embodiment, W₁ is a bondand Y₁ is X₄₆, X₄₆X₄₇ or X₄₆X₄₇X₄₈ (SEQ ID NO: 24) wherein X₄₆, X₄₇ andX₄₈ are each alanine and Z is a polyethylene glycol of 4-14 monomerunits. In one embodiment, Y₁ is a bond and W₁ is X₄₆, X₄₆X₄₇ orX₄₆X₄₇X₄₈ (SEQ ID NO: 24) wherein X₄₆, X₄₇ and X₄₈ are each alanine andZ is a polyethylene glycol of 4-14 monomer units.

In one embodiment the linking moiety comprises two polyethylene chainsseparated by 1, 2, 3 or 4 amino acids. In this embodiment the linkingmoiety comprises the general structure: W₂-Z₂—Y₂

wherein

W₂ and Y₂ are independently a polyethylene glycol of the generalstructure

and Z₂ is a bond, X₄₆, X₄₆X₄₇, or X₄₆X₄₇X₄₈, wherein m is an integerranging from 3-7 and each of X₄₆, X₄₇, and X₄₈ are independently anyamino acid. In one embodiment Z₂ is X₄₆ or X₄₆X₄₇, and in a furtherembodiment X₄₆ and X₄₇ are independently Lys or Cys. In one embodimentZ₂ comprises a pegylated Lys or Cys amino acid. In one embodiment thelinking moiety comprises a two polyethylene chains representing a totalof 8-12 or 10-14 or 12 monomeric units of ethylene glycol separated by asingle amino acid. In one embodiment the single amino acid is lysine orcysteine. In one embodiment Z₂ is a pegylated lysine.

In one embodiment a single chain insulin analog is provided comprisingan A chain and a C-terminally truncated B chain, having amino acidsB26-B30 (relative to the native insulin sequence) removed, wherein saidA chain and B chain are human insulin sequences, or analogs orderivatives thereof, further wherein the carboxy terminus of the B25amino acid of the B chain is directly linked to a first end of a linkingmoiety and a second end of the linking moiety is directly linked to theamino terminus of the A1 amino acid of the A chain. In one embodimentthe truncated B chain comprises the sequence of SEQ ID NO: 21 whereinthe B25 amino acid is directly linked to the N terminus of the linkingpeptide. In this embodiment the linking moiety comprises either

-   -   a) a polyethylene glycol of 6-16 monomer units;    -   b) a non-native amino acid sequence of at least 8 amino acids        and no more than 17 amino acid in length and comprising the        sequence (Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇R(Y₂)_(n) (SEQ ID NO: 28),        (Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆RR(Y₂)_(n) (SEQ ID NO: 23),        (Y₁)_(k)-GYGSSSX₅₇X₅₈(Y₂)_(n) (SEQ ID NO: 85),        (Y₁)_(k)-GAGSSSX₅₇X₅₈(Y₂)_(n) (SEQ ID NO: 163),        (Y₁)_(k)-GYGSSSX₅₇R (SEQ ID NO: 51) or        (Y₁)_(k)—X₅₁X₅₂GSSSX₅₇X₅₈—(Y₂)_(n) (SEQ ID NO: 29); or    -   c) a combination of said polyethylene glycol and a non-native        amino acid sequence of 1 to 4 amino acids;

wherein

n is 0 or 1;

k is 0 or 1;

Y₁ is selected from the group X₄₆, X₄₆X₄₇; and

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15);

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is alanine, valine, leucine, isoleucine or proline;

X₅₇ and X₅₈ are independently arginine, lysine, cysteine, homocysteine,acetyl-phenylalanine or ornithine; and

X₄₆ is phenylalanine or tyrosine;

X₄₇ is asparagine, glutamic acid, aspartic acid or threonine;

X₇₀-X₇₃ are independently any amino acid, with the proviso that when kis 0, the linking peptide does not comprise the sequence YTPKT (SEQ IDNO: 16) or FNKPT (SEQ ID NO: 76). In one embodiment one of X₅₇ and X₅₈is linked to a hydrophilic moiety or is acylated. In one embodiment oneof X₅₇ and X₅₈ is pegylated. In one embodiment the linking moietycomprises the sequence (Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆RR(Y₂)_(n) (SEQ ID NO:23) or (Y₁)_(k)-GYGSSSX₅₇R (SEQ ID NO: 51) wherein

k is 0 or 1;

n is 0 or 1;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₃ through X₅₆ are each independently any amino acid;

X₅₇ is lysine, ornithine or arginine;

Y₁ is selected from the group X₄₆, X₄₆X₄₇, and X₄₆X₄₇X₄₈

-   -   wherein    -   X₄₆ is phenylalanine or tyrosine;    -   X₄₇ is asparagine, glutamic acid, aspartic acid or threonine;    -   X₄₈ is aspartic acid, arginine, lysine or proline; and

Y₂ is selected from the group consisting of A, AP, APQ and APQT (SEQ IDNO: 82).

In one embodiment the A chain is an amino acid sequence derivative of asequence selected from the group consisting of GIVEQCCTSICSLYQLENYCN(SEQ ID NO: 1), GIVDECCFRSCDLRRLEMYCA (SEQ ID NO: 5) orGIVEECCFRSCDLALLETYCA (SEQ ID NO: 7) and the B chain comprises thesequence FVNQHLCGSHLVEALYLVCGERGFFYTPKT (SEQ ID NO: 2), orGPETLCGX₂₆ELVDX₂₇LYLVCGDX₄₂GFYFNKPT-R₁₄ (SEQ ID NO: 197), wherein X₂₆and X₂₇ are each alanine and X₄₂ is arginine, or a carboxy shortenedsequence thereof having one to five amino acids corresponding to B26,B27, B28, B29 and B30 deleted

Insulin a and B Chains

The single chain insulin agonists of the present invention may comprisethe native B and A chain sequences of human insulin (SEQ ID NOs: 1 and2, respectively) or any of the known analogs or derivatives thereof thatexhibit insulin agonist activity when linked to one another in aheteroduplex. Such analogs include, for example, proteins that having anA-chain and a B-chain that differ from the A-chain and B-chain of humaninsulin by having one or more amino acid deletions, one or more aminoacid substitutions, and/or one or more amino acid insertions that do notdestroy the insulin activity of the insulin analog.

One type of insulin analog, “monomeric insulin analog,” is well known inthe art. These are fast-acting analogs of human insulin, including, forexample, insulin analogs wherein:

(a) the amino acyl residue at position B28 is substituted with Asp, Lys,Leu, Val, or Ala, and the amino acyl residue at position B29 is Lys orPro;

(b) the amino acyl residues at any of positions B27, B28, B29, and B30are deleted or substituted with a nonnative amino acid. In oneembodiment an insulin analog is provided comprising an Asp substitutedat position B28 or a Lys substituted at position 28 and a prolinesubstituted at position B29. Additional monomeric insulin analogs aredisclosed in Chance, et al., U.S. Pat. No. 5,514,646; Chance, et al.,U.S. patent application Ser. No. 08/255,297; Brems, et al., ProteinEngineering, 5:527-533 (1992); Brange, et al., EPO Publication No.214,826 (published Mar. 18, 1987); and Brange, et al., Current Opinionin Structural Biology, 1:934-940 (1991). These disclosures are expresslyincorporated herein by reference for describing monomeric insulinanalogs.

Insulin analogs may also have replacements of the amidated amino acidswith acidic forms. For example, Asn may be replaced with Asp or Glu.Likewise, Gln may be replaced with Asp or Glu. In particular, Asn(A18),Asn(A21), or Asp(B3), or any combination of those residues, may bereplaced by Asp or Glu. Also, Gln(A15) or Gln(B4), or both, may bereplaced by either Asp or Glu.

As disclosed herein single chain insulin agonists are providedcomprising a B chain and A chain of human insulin, or analogs orderivative thereof, wherein the carboxy terminus of the B chain islinked to the amino terminus of the A chain via a linking moiety. In oneembodiment the A chain is an amino acid sequence selected from the groupconsisting of GIVEQCCTSICSLYQLENYCN (SEQ ID NO: 1),GIVDECCFRSCDLRRLEMYCA (SEQ ID NO: 5) or GIVEECCFRSCDLALLETYCA (SEQ IDNO: 7) and the B chain comprises the sequenceFVNQHLCGSHLVEALYLVCGERGFFYTPKT (SEQ ID NO: 2),GPETLCGX₂₆ELVDX₂₇LYLVCGDX₄₂GFYFNKPT-R₁₄ (SEQ ID NO: 197), wherein X₂₆and X₂₇ are each alanine and X₄₂ is arginine, or a carboxy shortenedsequence thereof having one to five amino acids corresponding to B26,B27, B28, B29 and B30 deleted, and analogs of those sequences whereineach sequence is modified to comprise one to five amino acidsubstitutions at positions corresponding to native insulin positions(see peptide alignment shown in FIG. 5) selected from A5, A8, A9, A10,A14, A15, A17, A18, A21, B1, B2, B3, B4, B5, B9, B10, B13, B14, B20,B22, B23, B26, B27, B28, B29 and B30. In one embodiment the amino acidsubstitutions are conservative amino acid substitutions. Suitable aminoacid substitutions at these positions that do not adversely impactinsulin's desired activities are known to those skilled in the art, asdemonstrated, for example, in Mayer, et al., Insulin Structure andFunction, Biopolymers. 2007; 88(5):687-713, the disclosure of which isincorporated herein by reference.

In accordance with one embodiment the single chain insulin analogpeptides may comprise an insulin A chain and an insulin B chain oranalogs thereof, wherein the A chain comprises an amino acid sequencethat shares at least 70% sequence identity (e.g., 70%, 75%, 80%, 85%,90%, 95%) over the length of the native peptide, with at least one ofGIVEQCCTSICSLYQLENYCN (SEQ ID NO: 1), GIVDECCFRSCDLRRLEMYCA (SEQ ID NO:5) or GIVEECCFRSCDLALLETYCA (SEQ ID NO: 7) and the B chain comprises anamino acid sequence that shares at least 60% sequence identity (e.g.,60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%) over the length of the nativepeptide, with at least one of the sequenceFVNQHLCGSHLVEALYLVCGERGFFYTPKT (SEQ ID NO: 2),GPETLCGX₂₆ELVDX₂₇LYLVCGDX₄₂GFYFNKPT-R₁₄ (SEQ ID NO: 197), wherein X₂₆and X₂₇ are each alanine and X₄₂ is arginine, or a carboxy shortenedsequence thereof having one to four amino acids corresponding to B27,B28, B29 and B30 deleted.

Additional amino acid sequences can be added to the amino terminus ofthe B chain or to the carboxy terminus of the A chain of the singlechain insulin agonists of the present invention. For example, a seriesof negatively charged amino acids can be added to the amino terminus ofthe B chain, including for example a peptide of 1 to 12, 1 to 10, 1 to 8or 1 to 6 amino acids in length and comprising one or more negativelycharged amino acids including for example glutamic acid and asparticacid. In one embodiment the B chain amino terminal extension comprises 1to 6 charged amino acids. In one embodiment the single chain insulinanalog comprises a B chain amino terminal extension that comprises thesequence X₆₀X₆₁X₆₂X₆₃X₆₄X₆₅K (SEQ ID NO: 47), wherein X₆₀ is selectedfrom the group consisting of glycine, glutamic acid and aspartic acid,and X₆₁, X₆₂, X₆₃X₆₄ and X₆₅ are independently glutamic acid or asparticacid. In one embodiment the B chain amino terminal extension comprisesthe sequence GX₆₁X₆₂X₆₃X₆₄X₆₅K (SEQ ID NO: 48) or X₆₁X₆₂X₆₃X₆₄X₆₅RK (SEQID NO: 49), wherein X₆₁, X₆₂, X₆₃X₆₄ and X₆₅ are independently glutamicacid or aspartic acid. In one embodiment the B chain comprises thesequence GEEEEEKGPEHLCGAHLVDALYLVCGDX₄₂GFY (SEQ ID NO: 50), wherein X₄₂is selected from the group consisting of alanine lysine, ornithine andarginine. In accordance with one embodiment the single chain insulinanalogs disclosed comprise a C-terminal amide or ester in place of aC-terminal carboxylate on the A chain.

High potency single chain insulin analogs can also be prepared based onmodified IGF I and IGF II sequences, as described in Internationalapplication PCT/2009/068713, the disclosure of which is expresslyincorporated herein by reference. More particularly, analogs of IGF Iand IGF II that comprise a substitution of a tyrosine leucine dipeptidefor the native IGF amino acids at positions corresponding to B16 and B17of native insulin have a tenfold increase in potency at the insulinreceptor. Accordingly, the single chain insulin analogs disclosed hereinmay include an A chain of IGF I (SEQ ID NO: 5) or IGF II (SEQ ID NO: 7)and a B chain of IGF I (SEQ ID NO: 6) or IGF II (SEQ ID NO: 8) or the Bchain of native insulin (SEQ ID NO: 2). In addition, the single chaininsulin analogs disclosed herein may include a native insulin A chain,or analog thereof, and a B chain of IGF I (SEQ ID NO: 6) or IGF II (SEQID NO: 8), as well as analogs of said B chains. In one embodiment thesingle chain insulin analog comprises an IGF I (SEQ ID NO: 5) A chain,or analog or derivative thereof and a B chain of IGF I (SEQ ID NO: 6),IGF II (SEQ ID NO: 8) or native insulin (SEQ ID NO: 2), or analogs orderivatives thereof.

Additional modifications to the single chain IGF or insulin A and Bchains include, for example, modification of the amino acids at one ormore of positions A19, B16 or B25 (relative to the native insulin A andB chains) to a 4-amino phenylalanine or one or more amino acidsubstitutions at positions selected from A5, A8, A9, A10, A14, A15, A17,A18, A21, B1, B2, B3, B4, B5, B9, B10, B13, B14, B20, B21, B22, B23,B26, B27, B28, B29 and B30 (relative to the native A and B chains ofinsulin) or deletions of any or all of positions B1-4 and B26-30. In oneembodiment the substitutions at positions selected from A5, A8, A9, A10,A14, A15, A17, A18, A21, B1, B2, B3, B4, B5, B9, B10, B13, B14, B20,B21, B22, B23, B26, B27, B28, B29 and B30 are conservative amino acidsubstitutions relative to the native insulin sequence.

In accordance with one embodiment the B chain comprises the sequenceR₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 21), and the Achain comprises the sequenceGIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22), wherein

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid, aspartic acid, asparagine, lysine, ornithine orglutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is tyrosine or phenylalanine;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and an N-terminal amine; and

R₁₃ is COOH or CONH₂. In one embodiment X₈, X₂₅ and X₃₀ are eachhistidine. In a further embodiment the single chain insulin analogpeptide comprises an analog of the A chain peptide sequence of SEQ IDNO: 19 and/or a B chain peptide sequence of SEQ ID NO: 20 wherein theanalog of the A chain and B chain each comprise 1-3 further amino acidsubstitutions.

In one embodiment a single chain insulin analog is provided thatcomprises the structure: IB-LM-IA, wherein IB comprises the sequenceR₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅(SEQ ID NO: 21), LM is alinking moiety as disclosed herein that covalently links IB to IA, andIA comprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 22), wherein

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid;

X₈ is histidine or phenylalanine;

X₉ and X₁₄ are independently selected from arginine, lysine, ornithineor alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid;

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is arginine, lysine, ornithine or leucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine or threonine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is alanine, glycine or asparagine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid and glutamicacid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and an N-terminal amine; and

R₁₃ is COOH or CONH₂, further wherein the amino acid at the designationX₄₅ is directly bound to the linking moiety, LM (i.e., the designationIB-LM-IA as used herein is intended to represent that the B chaincarboxyl terminus and the amino terminus of the A chain are directlylinked to the linking moiety LM without any further intervening aminoacids).

In accordance with one embodiment the linking moiety LM is selected fromthe group consisting of (Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇R(Y₂)_(n) (SEQ IDNO: 28), (Y₁)_(k), (Y₁)_(k)-GX₅₂GSSSX₅₇R—(Y₂)_(n) (SEQ ID NO: 90),(Y₁)_(k)-GYGSSSX₅₇R(Y₂)_(n) (SEQ ID NO: 51) and

wherein

Y₁ is selected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13); and

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15);

n is 0 or 1;

k is 0 or 1;

m is an integer selected from 8 to 16;

X₄₆ through X₅₀ and X₇₀ through X₇₃ are each independently any aminoacid;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine; and

X₅₃, X₅₄, X₅₅ and X₅₆ are independently selected from the groupconsisting of glycine, alanine, serine, threonine and proline and

X₅₇ is arginine, lysine or ornithine. In accordance with one embodimentat least one of n or k is 1. In one embodiment Y₂ is selected from thegroup consisting of A, AP, APQ and APQT (SEQ ID NO: 82) and Y₁ isselected from the group consisting of F, Y, FN, YT, FNK, YTP, FNKP (SEQID NO: 77), FNPK (SEQ ID NO: 79), YTPK (SEQ ID NO: 78), YTPKT (SEQ IDNO: 16), YTKPT (SEQ ID NO: 80), FNKPT (SEQ ID NO: 76) and FNPKT (SEQ IDNO: 81). In one embodiment the linking moiety comprises a sequenceselected from X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇R(Y₂)_(n) (SEQ ID NO: 28),(Y₁)_(k)-GYGSSSX₅₇R(Y₂)_(n) (SEQ ID NO: 51) and(Y₁)_(k)-GX₅₂GSSSX₅₇R(Y₂)_(n) (SEQ ID NO: 90), wherein X₅₁ is selectedfrom the group consisting of glycine, alanine, valine, leucine,isoleucine, proline, phenylalanine and methionine, X₅₂ is selected fromthe group consisting of glycine, alanine, valine, leucine, isoleucine,serine, threonine and proline, X₅₃ is other than glycine, X₅₄, and X₅₅are independently selected from the group consisting of glycine,alanine, serine, threonine and proline, X₅₆ is serine and X₅₇ isarginine, lysine or ornithine. In one embodiment n is 1 and k is 0,alternatively in one embodiment k is 1 and n is 0 and in one embodimentboth n and k are 1. In one embodiment the linking moiety is polyethyleneglycol wherein m is an integer selected from 10 to 14.

In accordance with one embodiment an insulin analog is provided whereinthe A chain of the insulin peptide comprises the sequenceGIVEQCCX₈SICSLYQLX₁₇NX₁₉CX₂₃ (SEQ ID NO: 52) and the B chain comprisingthe sequence X₂₅LCGX₂₉X₃₀LVEALYLVCGERGFF (SEQ ID NO: 53) wherein

X₈ is selected from the group consisting of threonine and histidine;

X₁₇ is glutamic acid or glutamine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₃ is asparagine or glycine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid. In a furtherembodiment the B chain comprises the sequenceX₂₂VNQX₂₅LCGX₂₉X₃₀LVEALYLVCGERGFFYT-Z₁-B₁ (SEQ ID NO: 54) wherein

X₂₂ is selected from the group consisting of phenylalanine anddesamino-phenylalanine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

Z₁ is a dipeptide selected from the group consisting ofaspartate-lysine, lysine-proline, and proline-lysine; and

B₁ is selected from the group consisting of threonine, alanine or athreonine-arginine-arginine tripeptide.

In accordance with one embodiment a single chain insulin analog isprovided that comprises the structure: IB-LM-IA, wherein IB comprisesthe sequence X₂₅LCGX₂₉X₃₀LVEALYLVCG ERGFF (SEQ ID NO: 53), LM is alinking moiety as disclosed herein that covalently links IB to IA, andIA comprises the sequence GIVEQCCX₈SICSLYQLENX₁₉CX₂₁ (SEQ ID NO: 55),wherein the C-terminal phenylalanine residue of SEQ ID NO: 54 isdirectly covalently bound to the linking moiety, LM, in the absence ofany intervening amino acids. In accordance with one embodiment linkingmoiety LM is selected from the group consisting of(Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇R(Y₂)_(n) (SEQ ID NO: 28),(Y₁)_(k)-GYGSSSX₅₇R(Y₂)_(n) (SEQ ID NO: 51) and

wherein

Y₁ is selected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13); and

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15);

n is 0 or 1;

k is 0 or 1;

m is an integer ranging from 7 to 16;

X₄₆ through X₅₀ and X₇₀ through X₇₃ are each independently any aminoacid; and

X₅₇ is arginine, lysine or ornithine.

In accordance with one embodiment the single chain insulin analogcomprises a B chain having the sequence R₂₂—HLCGSX₃₀LVEALYLVCGERGFF (SEQID NO: 154) or R₂₄—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 21)and an A having the sequence GIVEQCCX₈SICSLYQLENX₁₉CX₂₁—R₁₃ (SEQ ID NO:55) or GIVX₄ECCX₈X₉SCDLX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 19)

wherein

X₄ is glutamic acid or aspartic acid;

X₈ is histidine, threonine or phenylalanine;

X₉ is arginine, lysine, ornithine or alanine;

X₁₄ is arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamine or glutamic acid

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₂ is selected from the group consisting of phenylalanine anddesamino-phenylalanine;

X₂₃ is asparagine or glycine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine and glycine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is tyrosine or phenylalanine;

R₂₂ is selected from the group consisting of X₂₂VNQ (SEQ ID NO: 84), atripeptide valine-asparagine-glutamine, a dipeptideasparagine-glutamine, glutamine, and a bond;

R₂₄ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, adipeptide proline-glutamic acid, glutamine, glutamic acid and a bond;and

R₁₃ is COOH or CONH₂.

In accordance with some embodiments the single chain insulin analogcomprises a B chain having the sequenceR₂₃—R₂₄—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 21) orR₂₃—R₂₂—HLCGSX₃₀LVEALYLVCGERGFF (SEQ ID NO: 154) and an A chain havingthe sequence GIVX₄ECCX₈X₉SCDLX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 19)

wherein

X₄ is glutamic acid or aspartic acid;

X₈ is histidine, threonine or phenylalanine;

X₉ is arginine, lysine, ornithine or alanine;

X₁₄ is arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamine or glutamic acid;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₂ is selected from the group consisting of phenylalanine anddesamino-phenylalanine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine and glycine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is tyrosine or phenylalanine;

R₂₂ is selected from the group consisting of X₂₂VNQ (SEQ ID NO: 84), atripeptide valine-asparagine-glutamine, a dipeptideasparagine-glutamine, glutamine, and a bond;

R₂₃ is an N-terminal amine or X₆₀(X₆₁X₆₂)_(d)X₆₃K (SEQ ID NO: 192)

-   -   wherein    -   X₆₀ is selected from the group consisting of glycine, glutamic        acid and aspartic acid;    -   X₆₁ and X₆₂ are independently selected from the group consisting        of glutamic acid and aspartic acid;    -   X₆₃ is selected from the group consisting of arginine aspartic        acid and glutamic acid;    -   d is an integer ranging from 1-3;

R₂₄ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, adipeptide proline-glutamic acid, glutamine, glutamic acid and a bond;and

R₁₃ is COOH or CONH₂.

In accordance with some embodiments the A chain comprises the sequenceGIVEQCCX₈SICSLYQLX₁₇NX₁₉CX₂₃ (SEQ ID NO: 52) orGIVDECCX₈X₉SCDLX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 56), and the B chaincomprises the sequence X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGDX₄₂GFX₄₅ (SEQ ID NO:58) wherein

X₈ is histidine or phenylalanine;

X₉ and X₁₄ are independently selected from arginine, lysine, ornithineor alanine;

X₁₅ is arginine, lysine, ornithine or leucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine or threonine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is alanine, glycine or asparagine;

X₂₃ is asparagine or glycine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid and glutamicacid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is tyrosine; and

R₁₃ is COOH or CONH₂. In one embodiment at least one of n and k is 1.

In a further embodiment the A chain comprises the sequenceGIVDECCHX₉SCDLX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 56), and the B chaincomprises the sequence X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGDX₄₂GFX₄₅ (SEQ ID NO:58) wherein

X₉ and X₁₄ are independently selected from arginine, lysine, ornithineor alanine;

X₁₅ is arginine, lysine, ornithine or leucine;

X₁₇ is glutamic acid, aspartic acid, asparagine, lysine, ornithine orglutamine;

X₁₈ is methionine, asparagine or threonine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is alanine, glycine or asparagine;

X₂₃ is asparagine or glycine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid and glutamicacid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is tyrosine or phenylalanine and

R₁₃ is COOH or CONH₂. In a further embodiment the A chain comprises thesequence GIVDECCHX₉SCDLX₁₄X₁₅LX₁₇MX₁₉CX₂₁—R₁₃ (SEQ ID NO: 59), and the Bchain comprises the sequence X₂₅LCGAX₃₀LVDALYLVCGDX₄₂GFX₄₅ (SEQ ID NO:60) wherein X₉, X₁₄ and X₁₅ are independently ornithine, lysine orarginine;

X₁₇ is glutamic acid or glutamine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is alanine, glycine or asparagine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₃₀ is selected from the group consisting of histidine, aspartic acidand glutamic acid;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is tyrosine or phenylalanine and

R₁₃ is COOH or CONH₂. In one embodiment the B chain is selected from thegroup consisting of HLCGAELVDALYLVCGDX₄₂GFY (SEQ ID NO: 61),GPEHLCGAELVDALYLVCGDX₄₂GFY (SEQ ID NO: 62),GPEHLCGAELVDALYLVCGDX₄₂GFYFNPKT (SEQ ID NO: 63) andGPEHLCGAELVDALYLVCGDX₄₂GFYFNKPT (SEQ ID NO: 64), wherein X₄₂ is selectedfrom the group consisting of ornithine, lysine and arginine. In afurther embodiment the A chain comprises the sequenceGIVDECCHX₉SCDLX₁₄X₁₅LQMYCN—R₁₃ (SEQ ID NO: 66), wherein X₉, X₁₄ and X₁₅are independently ornithine, lysine or arginine.

In accordance with one embodiment the linking moiety is a peptide isselected from the group consisting of(Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇R(Y₂)_(n) (SEQ ID NO: 28) and(Y₁)_(k)-GYGSSSX₅₇R (SEQ ID NO: 51), wherein

Y₁ is selected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13); and

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15), n is 0 or 1 and k is 0 or 1, wherein at least one of nand k is 1. In one embodiment n is 1 and Y₁ is selected from the groupconsisting of F, Y, FN, YT, FNK, YTP, FNPK (SEQ ID NO: 79), FNKP (SEQ IDNO: 77), YTPK (SEQ ID NO: 78), YTPKT (SEQ ID NO: 16), YTKPT (SEQ ID NO:80), FNKPT (SEQ ID NO: 76) and FNPKT (SEQ ID NO: 81). In anotherembodiment Y₁ is selected from the group consisting of F, FN, FNK, FNPK(SEQ ID NO: 79), FNKPT (SEQ ID NO: 76) and FNPKT (SEQ ID NO: 81). In oneembodiment Y₂ is selected from the group consisting of A, AP, APQ andAPQT (SEQ ID NO: 82). In a further embodiment both n and k are 1 and Y₁is selected from the group consisting of F, Y, FN, YT, FNK, YTP, FNPK(SEQ ID NO: 79), FNKP (SEQ ID NO: 77), YTPK (SEQ ID NO: 78), YTPKT (SEQID NO: 16), YTKPT (SEQ ID NO: 80), FNKPT (SEQ ID NO: 76) and FNPKT (SEQID NO: 81) and Y₂ is selected from the group consisting of A, AP, APQand APQT (SEQ ID NO: 82).

In one embodiment a single chain insulin analog is provided comprisingthe general formula IB-LM-IA wherein IB is an amino acid sequenceselected from the group consisting of HLCGAELVDALYLVCGDX₄₂GFY (SEQ IDNO: 61), GPEHLCGAELVDALYLVCGDX₄₂GFY (SEQ ID NO: 62),GPEHLCGAELVDALYLVCGDX₄₂GFYFNPKT (SEQ ID NO: 63) andGPEHLCGAELVDALYLVCGDX₄₂GFYFNKPT (SEQ ID NO: 64), LM is a linking moietyselected from the group consisting of GAGSSSX₅₇RAPQT SEQ ID NO: 66),GYGSSSX₅₇R (SEQ ID NO: 51) and

and IA is the amino acid sequence GIVDECCHX₉SCDLX₁₄X₁₅LQMYCN—R₁₃ (SEQ IDNO: 66), wherein X₉, X₁₄, X₁₅X₄₂ and X₅₇ are independently ornithine,lysine or arginine, and m is an integer selected from the range of 10 to12. In one further embodiment the linking moiety is GYGSSSOR (SEQ ID NO:65).

In accordance with one embodiment a single chain insulin analog isprovided that comprises the structure: IB-LM-IA, wherein IB comprisesthe sequence X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGDX₄₂GFX₄₅ (SEQ ID NO: 58), LM isa linking moiety as disclosed herein that covalently links IB to IA, andIA comprises the sequence GIVEQCCHSICSLYQLENX₁₉CX₂₁—R₁₃ (SEQ ID NO: 68)or GIVDECCX₈X₉SCDLX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 69), wherein theC-terminal phenylalanine residue of SEQ ID NO: 69 is directly covalentlybound to the linking moiety, LM, in the absence of any intervening aminoacids. In accordance with one embodiment a single chain insulin analogis provided that comprises the sequenceX₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGDX₄₂GFX₄₅ (SEQ ID NO:58)-(Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇X₅₈(Y₂)_(n) (SEQ ID NO:70)-GIVDECCX₈X₉SCDLX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 56) orX₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGDX₄₂GFX₄₅(Y₁)_(k) (SEQ ID NO:58)-X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇X₅₈(Y₂)_(n) (SEQ ID NO:70)-GIVEQCCHSICSLYQLENX₁₉CX₂₁—R₁₃ (SEQ ID NO: 68), wherein X₄₆ throughX₅₆ and X₇₀X₇₁X₇₂X₇₃ (SEQ ID NO: 15) are independently any amino acid,X₅₇ and X₅₈ are independently arginine, ornithine or lysine, Y₁ isselected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQ ID NO:24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13),Y₂ is selected from the groupX₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃ (SEQ ID NO: 15) wherein n and kare independently 0 or 1;

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid, aspartic acid, asparagine, lysine, ornithine orglutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is tyrosine, 4-methoxy phenylalanine or 4-amino-phenylalanine;

X₂₁ is alanine, glycine or asparagine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine; and

X₄₅ is tyrosine or phenylalanine;

In accordance with one embodiment a single chain insulin analog isprovided that comprises the structure: IB-LM-IA, wherein IB comprisesthe sequence X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGDX₄₂GFX₄₅ (SEQ ID NO: 58), LM isa linking moiety selected from the group consisting of(Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆RR(Y₂)_(n) (SEQ ID NO: 23),(Y₂)_(k)-GYGSSSX₅₇R(Y₂)_(n) (SEQ ID NO: 51) and

wherein

Y₁ is selected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13); and

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15);

n is 0 or 1;

k is 0 or 1;

m is an integer ranging from 7 to 16; and

X₄₆ through X₅₀ and X₇₀ through X₇₃ are each independently any aminoacid;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₃ through X₅₆ are each independently any amino acid; and

X₅₇ is arginine, lysine or ornithine; and the A chain comprises thesequence GIVDECCHX₉SCDLX₁₄X₁₅LQMYCN—R₁₃ (SEQ ID NO: 66), wherein X₉, X₁₄and X₁₅ are independently ornithine, lysine or arginine and R₁₃ is COOHor CONH₂.

In one embodiment the B chain is selected from the group consisting ofHLCGAELVDALYLVCGDOGFY (SEQ ID NO: 71), GPEHLCGAELVDALYLVCGDOGFY (SEQ IDNO: 72), GPEHLCGAELVDALYLVCGDOGFYFNPKT (SEQ ID NO: 73) andGPEHLCGAELVDALYLVCGDOGFYFNKPT (SEQ ID NO: 74) and the A chain isGIVDECCHOSCDLOOLQMX₁₉CN—R₁₃ (SEQ ID NO: 75), wherein X₁₉ is tyrosine,4-methoxy-phenylalanine or 4-amino phenylalanine. In one embodiment atleast one of n and k is 1.

In one embodiment a single chain insulin analog is provided comprisingthe sequenceX₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈X₇₀X₇₁X₇₂X₇₃GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LEX₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 168);X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅X₅₁X₅₂GSSSX₅₇X₅₈APQTGIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LEX₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 169);X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅X₄₆X₄₇X₄₈TX₅₁X₅₂GSSSX₅₇X₅₈APQTGIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LEX₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 170;X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅X₄₆X₄₇X₄₈TX₅₁X₅₂GSSSX₅₇X₅₈GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LEX₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 171); orX₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅GAGSSSRX₅₈APQTGIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LEX₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 172);

wherein

X₄ is glutamic acid or aspartic acid;

X₅ is glutamic acid or glutamine

X₈ is threonine, histidine or phenylalanine;

X₉ is serine, arginine, ornithine or alanine;

X₁₀ is serine or isoleucine;

X₁₂ is serine or aspartic acid;

X₁₄ is arginine, tyrosine, ornithine or alanine;

X₁₅ is glutamine, arginine, alanine, ornithine or leucine;

X₁₈ is methionine, asparagine or threonine;

X₁₉ is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;

X₂₁ is alanine, glycine or asparagine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid and glutamicacid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of aspartic acid and glutamicacid;

X₄₂ is selected from the group consisting of alanine, ornithine andarginine;

X₄₅ is selected from the group consisting of tyrosine and phenylalanine;

X₄₆ is phenylalanine or tyrosine;

X₄₇ is asparagine or threonine;

X₄₈ is an aspartate-lysine dipeptide, an arginine-proline dipeptide, alysine-proline dipeptide, or a proline-lysine dipeptide;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₃ through X₅₆ are each independently any amino acid; and

X₅₇ and X₅₈ are independently arginine, lysine, cysteine, homocysteine,acetyl-phenylalanine or ornithine;

X₇₀-X₇₃ are independently any amino acid and R₁₃ is COOH or CONH₂. Inone embodiment the single chain insulin further comprises a polyethyleneglycol chain covalently linked to the side chain of an amino acid of thelinking moiety and/or at a position selected from the group consistingof the N-terminal alpha amine of the B chain, the side chain of an aminoacid at position A9, A14 and A15 of the A chain or positions B1, B2,B10, B22, B28 or B29 of the B chain. In one embodiment the polyethyleneglycol chain is covalently linked to the side chain of an amino acid ofthe linking moiety and/or at the N-terminal alpha amine of the B chain(e.g. at position B1 for insulin based B chain or position B2 for IGF-1based B chain). In one embodiment the polyethylene glycol chain iscovalently linked to the side chain of an amino acid at position 8 ofthe linking moiety.

Pegylation of Single Chain Insulin Analogs

Applicants have surprisingly discovered that covalently linkage of ahydrophilic moiety to the insulin single chain analogs disclosed hereinprovide analogs having slower onset, extended duration and exhibit abasal profile of activity. In one embodiment, the single chain insulinanalogs disclosed herein are further modified to comprise a hydrophilicmoiety covalently linked to the side chain of an amino acid at aposition selected from the group consisting of A9, A14 and A15 of the Achain or at the N-terminal alpha amine of the B chain (e.g. at positionB1 for insulin based B chain or position B2 for IGF-1 based B chain) orat the side chain of an amino acid at position B1, B2, B10, B22, B28 orB29 of the B chain or at any position of the linking moiety that linksthe A chain and B chain. In exemplary embodiments, this hydrophilicmoiety is covalently linked to a Lys, Cys, Orn, homocysteine, oracetyl-phenylalanine residue at any of these positions. In oneembodiment the hydrophilic moiety is covalently linked to the side chainof an amino acid of the linking moiety.

Exemplary hydrophilic moieties include polyethylene glycol (PEG), forexample, of a molecular weight of about 1,000 Daltons to about 40,000Daltons, or about 20,000 Daltons to about 40,000 Daltons. Additionalsuitable hydrophilic moieties include, polypropylene glycol,polyoxyethylated polyols (e.g., POG), polyoxyethylated sorbitol,polyoxyethylated glucose, polyoxyethylated glycerol (POG),polyoxyalkylenes, polyethylene glycol propionaldehyde, copolymers ofethylene glycol/propylene glycol, monomethoxy-polyethylene glycol,mono-(C1-C10) alkoxy- or aryloxy-polyethylene glycol,carboxymethylcellulose, polyacetals, polyvinyl alcohol (PVA), polyvinylpyrrolidone, poly-1,3-dioxolane, poly-1,3,6-trioxane, ethylene/maleicanhydride copolymer, poly (beta-amino acids) (either homopolymers orrandom copolymers), poly(n-vinyl pyrrolidone)polyethylene glycol,propropylene glycol homopolymers (PPG) and other polyakylene oxides,polypropylene oxide/ethylene oxide copolymers, colonic acids or otherpolysaccharide polymers, Ficoll or dextran and mixtures thereof.

The hydrophilic moiety, e.g., polyethylene glycol chain in accordancewith some embodiments has a molecular weight selected from the range ofabout 500 to about 40,000 Daltons. In one embodiment the hydrophilicmoiety, e.g. PEG, has a molecular weight selected from the range ofabout 500 to about 5,000 Daltons, or about 1,000 to about 5,000 Daltons.In another embodiment the hydrophilic moiety, e.g., PEG, has a molecularweight of about 10,000 to about 20,000 Daltons. In yet other exemplaryembodiment the hydrophilic moiety, e.g., PEG, has a molecular weight ofabout 20,000 to about 40,000 Daltons. In one embodiment the hydrophilicmoiety, e.g. PEG, has a molecular weight of about 20,000 Daltons. In oneembodiment a single chain insulin analog is provided wherein one or moreamino acids of the analog are pegylated, and the combined molecularweight of the covalently linked PEG chains is about 20,000 Daltons.

In one embodiment dextrans are used as the hydrophilic moiety. Dextransare polysaccharide polymers of glucose subunits, predominantly linked byα1-6 linkages. Dextran is available in many molecular weight ranges,e.g., about 1 kD to about 100 kD, or from about 5, 10, 15 or 20 kD toabout 20, 30, 40, 50, 60, 70, 80 or 90 kD.

Linear or branched polymers are contemplated. Resulting preparations ofconjugates may be essentially monodisperse or polydisperse, and may haveabout 0.5, 0.7, 1, 1.2, 1.5 or 2 polymer moieties per peptide.

In one embodiment the hydrophilic moiety is a polyethylene glycol (PEG)chain, optionally linked to the side chain of an amino acid at aposition selected from the group consisting of A9, A14 and A15 of the Achain, positions B1, B2, B10, B22, B28 or B29 of the B chain, at theN-terminal alpha amine of the B chain, or at any position of the linkingmoiety that links the A chain and B chain, including for example atposition C8. In one embodiment the single chain insulin analog comprisesa peptide linking moiety of 8 to 12 amino acids, wherein one of theamino acids of the linking moiety has a polyethylene chain covalentlybound to its side chain. In one embodiment the single chain insulinanalog comprises a peptide linking moiety of 8 to 12 amino acids,wherein an amino acid of the linking moiety is pegylated and one or moreamino acid at a position selected from the group consisting of A9, A14and A15 of the A chain, positions B1, B2, B10, B22, B28 or B29 of the Bchain is also pegylated. In one embodiment the total molecular weight ofthe covalently linked PEG chain(s) is about 20,000 Daltons.

In one embodiment a single chain insulin analog comprises a linkingmoiety of 8 to 12 amino acids, wherein one of the amino acids of thelinking moiety has a 20,000 Dalton polyethylene chain covalently boundto its side chain. In another embodiment a insulin analog comprises apeptide linking moiety of 8 to 12 amino acids, wherein one of the aminoacids of the linking moiety has a polyethylene chain covalently bound toits side chain and a second PEG chain is linked to the N-terminal alphaamine of the B chain (e.g. at position B1 for insulin based B chain orposition B2 for IGF-1 based B chain) or at the side chain of an aminoacid at position B1, B2 and B29 of the B chain. In one embodiment whentwo PEG chains are linked to the single chain insulin analog, each PEGchain has a molecular weight of about 10,000 Daltons. In one embodimentwhen the PEG chain is linked to an 8 to 12 amino acid linking moiety,the PEG chain is linked at position C7 or C8 of the linking moiety andin one embodiment the PEG chain is linked at position C8 of the linkingmoiety. In one embodiment when two PEG chains are linked to the singlechain insulin analog, with one PEG chain linked at position C8 and thesecond PEG is linked at A9, A14, A15, B1, B2, B10, B22, B28 or B29.

Hydrophilic moieties such as polyethylene glycol can be attached to thesingle chain insulin analog under any suitable conditions used to reacta protein with an activated polymer molecule. Any means known in the artcan be used, including via acylation, reductive alkylation, Michaeladdition, thiol alkylation or other chemoselective conjugation/ligationmethods through a reactive group on the PEG moiety (e.g., an aldehyde,amino, ester, thiol, α-haloacetyl, maleimido or hydrazino group) to areactive group on the target compound (e.g., an aldehyde, amino, ester,thiol, α-haloacetyl, maleimido or hydrazino group). Activating groupswhich can be used to link the water soluble polymer to one or moreproteins include without limitation sulfone, maleimide, sulfhydryl,thiol, triflate, tresylate, azidirine, oxirane and 5-pyridyl. Ifattached to the peptide by reductive alkylation, the polymer selectedshould have a single reactive aldehyde so that the degree ofpolymerization is controlled. See, for example, Kinstler et al., Adv.Drug. Delivery Rev. 54: 477-485 (2002); Roberts et al., Adv. DrugDelivery Rev. 54: 459-476 (2002); and Zalipsky et al., Adv. DrugDelivery Rev. 16: 157-182 (1995).

In a specific aspect of the invention, an amino acid residue on thesingle chain analog having a thiol is modified with a hydrophilic moietysuch as PEG. In some embodiments, the thiol is modified withmaleimide-activated PEG in a Michael addition reaction to result in aPEGylated peptide comprising the thioether linkage shown below:

In some embodiments, the thiol is modified with a haloacetyl-activatedPEG in a nucleophilic substitution reaction to result in a PEGylatedpeptide comprising the thioether linkage shown below:

Acylation of Single Chain Insulin Analogs

In some embodiments, the single chain insulin analog is modified tocomprise an acyl group. The acyl group can be covalently linked directlyto an amino acid of the single chain insulin analog, or indirectly to anamino acid of the single chain insulin analog via a spacer, wherein thespacer is positioned between the amino acid of the single chain insulinanalog and the acyl group. The single chain insulin analog may beacylated at the same amino acid position where a hydrophilic moiety islinked, or at a different amino acid position. For example, acylationmay occur at any position including any of amino acid of the A or Bchains as well as a position within the linking moiety, provided thatthe activity exhibited by the non-acylated single chain insulin analogis retained upon acylation. Nonlimiting examples include acylation atpositions A14 and A15 of the A chain, positions position B1 for insulinbased B chain or position B2 for IGF-1 based B chain or positions B10,B22, B28 or B29 of the B chain or at any position of the linking moiety.

In one specific aspect of the invention, the single chain insulin analog(or derivative or conjugate thereof) is modified to comprise an acylgroup by direct acylation of an amine, hydroxyl, or thiol of a sidechain of an amino acid of the single chain insulin analog. In someembodiments, the single chain insulin analog is directly acylatedthrough the side chain amine, hydroxyl, or thiol of an amino acid. Insome embodiments, acylation is at position B28 or B29 (according to theamino acid numbering of the native insulin A and B chain sequences). Inthis regard, a single chain insulin analog can be provided that has beenmodified by one or more amino acid substitutions in the A or B chainsequence, including for example at positions A14, A15, B1, B2, B10, B22,B28 or B29 (according to the amino acid numbering of the native insulinA and B chain sequences) or at any position of the linking moiety withan amino acid comprising a side chain amine, hydroxyl, or thiol. In somespecific embodiments of the invention, the direct acylation of thesingle chain insulin analog occurs through the side chain amine,hydroxyl, or thiol of the amino acid at position B28 or B29 (accordingto the amino acid numbering of the native insulin A and B chainsequences).

In one embodiment, the single chain insulin analog comprises an aminoacid of Formula I:

In some exemplary embodiments, the amino acid of Formula I, is the aminoacid wherein n is 4 (Lys) or n is 3 (Orn).

In another embodiment, the single chain insulin analog comprises anamino acid of Formula II:

In some exemplary embodiments, the amino acid of Formula II is the aminoacid wherein n is 1 (Ser).

In yet another embodiment, the single chain insulin analog comprises aside chain thiol is an amino acid of Formula III:

In some exemplary embodiments, the amino acid of Formula III is theamino acid wherein n is 1 (Cys).

In yet another embodiment, the single chain insulin analog comprises adisubstituted amino acid comprising the same structure of Formula I,Formula II, or Formula III, except that the hydrogen bonded to the alphacarbon of the amino acid of Formula I, Formula II, or Formula III isreplaced with a second side chain.

In accordance with one embodiment, the acylated single chain insulinanalogs comprise a spacer between the peptide and the acyl group. Insome embodiments, the single chain insulin analog is covalently bound tothe spacer, which is covalently bound to the acyl group. In someexemplary embodiments, the single chain insulin analog is modified tocomprise an acyl group by acylation of an amine, hydroxyl, or thiol of aspacer, which spacer is attached to a side chain of an amino acid atposition B28 or B29 (according to the amino acid numbering of the A or Bchain of native insulin), or at any position of the spacer moiety. Theamino acid of the single chain insulin analog to which the spacer isattached can be any amino acid comprising a moiety which permits linkageto the spacer. For example, an amino acid comprising a side chain —NH₂,—OH, or —COOH (e.g., Lys, Orn, Ser, Asp, or Glu) is suitable.

In some embodiments, the spacer between the single chain insulin analogand the acyl group is an amino acid comprising a side chain amine,hydroxyl, or thiol (or a dipeptide or tripeptide comprising an aminoacid comprising a side chain amine, hydroxyl, or thiol). In someembodiments, the spacer comprises a hydrophilic bifunctional spacer. Ina specific embodiment, the spacer comprises an aminopoly(alkyloxy)carboxylate. In this regard, the spacer can comprise, forexample, NH₂(CH₂CH₂O)_(n)(CH₂)_(m)COOH, wherein m is any integer from 1to 6 and n is any integer from 2 to 12, such as, e.g.,8-amino-3,6-dioxaoctanoic acid, which is commercially available fromPeptides International, Inc. (Louisville, Ky.). In one embodiment, thehydrophilic bifunctional spacer comprises two or more reactive groups,e.g., an amine, a hydroxyl, a thiol, and a carboxyl group or anycombinations thereof. In certain embodiments, the hydrophilicbifunctional spacer comprises a hydroxyl group and a carboxylate. Inother embodiments, the hydrophilic bifunctional spacer comprises anamine group and a carboxylate. In other embodiments, the hydrophilicbifunctional spacer comprises a thiol group and a carboxylate.

In some embodiments, the spacer between peptide the single chain insulinanalog and the acyl group is a hydrophobic bifunctional spacer.Hydrophobic bifunctional spacers are known in the art. See, e.g.,Bioconjugate Techniques, G. T. Hermanson (Academic Press, San Diego,Calif., 1996), which is incorporated by reference in its entirety. Incertain embodiments, the hydrophobic bifunctional spacer comprises twoor more reactive groups, e.g., an amine, a hydroxyl, a thiol, and acarboxyl group or any combinations thereof. In certain embodiments, thehydrophobic bifunctional spacer comprises a hydroxyl group and acarboxylate. In other embodiments, the hydrophobic bifunctional spacercomprises an amine group and a carboxylate. In other embodiments, thehydrophobic bifunctional spacer comprises a thiol group and acarboxylate. Suitable hydrophobic bifunctional spacers comprising acarboxylate and a hydroxyl group or a thiol group are known in the artand include, for example, 8-hydroxyoctanoic acid and 8-mercaptooctanoicacid.

In accordance with certain embodiments the bifunctional spacer can be asynthetic or naturally occurring amino acid comprising an amino acidbackbone that is 3 to 10 atoms in length (e.g., 6-amino hexanoic acid,5-aminovaleric acid, 7-aminoheptanoic acid, and 8-aminooctanoic acid).Alternatively, the spacer can be a dipeptide or tripeptide spacer havinga peptide backbone that is 3 to 10 atoms (e.g., 6 to 10 atoms) inlength. Each amino acid of the dipeptide or tripeptide spacer attachedto the single chain insulin analog can be independently selected fromthe group consisting of: naturally-occurring and/or non-naturallyoccurring amino acids, including, for example, any of the D or L isomersof the naturally-occurring amino acids (Ala, Cys, Asp, Glu, Phe, Gly,His, Ile, Lys, Leu, Met, Asn, Pro, Arg, Ser, Thr, Val, Trp, Tyr), or anyD or L isomers of the non-naturally occurring amino acids selected fromthe group consisting of: β-alanine (β-Ala), N-α-methyl-alanine (Me-Ala),aminobutyric acid (Abu), α-aminobutyric acid (γ-Abu), aminohexanoic acid(ε-Ahx), aminoisobutyric acid (Aib), aminomethylpyrrole carboxylic acid,aminopiperidinecarboxylic acid, aminoserine (Ams),aminotetrahydropyran-4-carboxylic acid, arginine N-methoxy-N-methylamide, β-aspartic acid (βAsp), azetidine carboxylic acid,3-(2-benzothiazolyl)alanine, α-tert-butylglycine,2-amino-5-ureido-n-valeric acid (citrulline, Cit), β-Cyclohexylalanine(Cha), acetamidomethyl-cysteine, diaminobutanoic acid (Dab),diaminopropionic acid (Dpr), dihydroxyphenylalanine (DOPA),dimethylthiazolidine (DMTA), γ-Glutamic acid (γ-Glu), homoserine (Hse),hydroxyproline (Hyp), isoleucine N-methoxy-N-methyl amide,methyl-isoleucine (MeIle), isonipecotic acid (Isn), methyl-leucine(MeLeu), methyl-lysine, dimethyl-lysine, trimethyl-lysine,methanoproline, methionine-sulfoxide (Met(O)), methionine-sulfone(Met(O2)), norleucine (Nle), methyl-norleucine (Me-Nle), norvaline(Nva), ornithine (Orn), para-aminobenzoic acid (PABA), penicillamine(Pen), methylphenylalanine (MePhe), 4-Chlorophenylalanine (Phe(4-CD),4-fluorophenylalanine (Phe(4-F)), 4-nitrophenylalanine (Phe(4-NO2)),4-cyanophenylalanine ((Phe(4-CN)), phenylglycine (Phg),piperidinylalanine, piperidinylglycine, 3,4-dehydroproline,pyrrolidinylalanine, sarcosine (Sar), selenocysteine (Sec),U-Benzyl-phosphoserine, 4-amino-3-hydroxy-6-methylheptanoic acid (Sta),4-amino-5-cyclohexyl-3-hydroxypentanoic acid (ACHPA),4-amino-3-hydroxy-5-phenylpentanoic acid (AHPPA),1,2,3,4,-tetrahydro-isoquinoline-3-carboxylic acid (Tic),tetrahydropyranglycine, thienylalanine (Thi), U-Benzyl-phosphotyrosine,O-Phosphotyrosine, methoxytyrosine, ethoxytyrosine,0-(bis-dimethylamino-phosphono)-tyrosine, tyrosine sulfatetetrabutylamine, methyl-valine (MeVal), 1-amino-1-cyclohexane carboxylicacid (Acx), aminovaleric acid, beta-cyclopropyl-alanine (Cpa),propargylglycine (Prg), allylglycine (Alg),2-amino-2-cyclohexyl-propanoic acid (2-Cha), tertbutylglycine (Tbg),vinylglycine (Vg), 1-amino-1-cyclopropane carboxylic acid (Acp),1-amino-1-cyclopentane carboxylic acid (Acpe), alkylated3-mercaptopropionic acid, 1-amino-1-cyclobutane carboxylic acid (Acb).In some embodiments the dipeptide spacer is selected from the groupconsisting of: Ala-Ala, β-Ala-β-Ala, Leu-Leu, Pro-Pro, γ-aminobutyricacid-γ-aminobutyric acid, and γ-Glu-γ-Glu.

The peptide the single chain insulin analog can be modified to comprisean acyl group by acylation of a long chain alkane. In specific aspects,the long chain alkane comprises an amine, hydroxyl, or thiol group (e.g.octadecylamine, tetradecanol, and hexadecanethiol) which reacts with acarboxyl group, or activated form thereof, of the single chain insulinanalog. The carboxyl group, or activated form thereof, of the singlechain insulin analog can be part of a side chain of an amino acid (e.g.,glutamic acid, aspartic acid) of the single chain insulin analog or canbe part of the peptide backbone.

In certain embodiments, the single chain insulin analog is modified tocomprise an acyl group by acylation of the long chain alkane by a spacerwhich is attached to the single chain insulin analog. In specificaspects, the long chain alkane comprises an amine, hydroxyl, or thiolgroup which reacts with a carboxyl group, or activated form thereof, ofthe spacer. Suitable spacers comprising a carboxyl group, or activatedform thereof, are described herein and include, for example,bifunctional spacers, e.g., amino acids, dipeptides, tripeptides,hydrophilic bifunctional spacers and hydrophobic bifunctional spacers.As used herein, the term “activated form of a carboxyl group” refers toa carboxyl group with the general formula R(C═O)X, wherein X is aleaving group and R is the single chain insulin analog or the spacer.For example, activated forms of a carboxyl groups may include, but arenot limited to, acyl chlorides, anhydrides, and esters. In someembodiments, the activated carboxyl group is an ester with aN-hydroxysuccinimide (NHS) leaving group.

With regard to these aspects of the invention, in which a long chainalkane is acylated by the peptide the single chain insulin analog or thespacer, the long chain alkane may be of any size and can comprise anylength of carbon chain. The long chain alkane can be linear or branched.In certain aspects, the long chain alkane is a C₄ to C₃₀ alkane. Forexample, the long chain alkane can be any of a C₄ alkane, C₆ alkane, C₈alkane, C₁₀ alkane, C₁₂ alkane, C₁₄ alkane, C₁₆ alkane, C₁₈ alkane, C₂₀alkane, C₂₂ alkane, C₂₄ alkane, C₂₆ alkane, C₂₈ alkane, or a C₃₀ alkane.In some embodiments, the long chain alkane comprises a C₈ to C₂₀ alkane,e.g., a C₁₄ alkane, C₁₆ alkane, or a C₁₈ alkane.

In some embodiments, an amine, hydroxyl, or thiol group of the singlechain insulin analog is acylated with a cholesterol acid. In a specificembodiment, the peptide is linked to the cholesterol acid through analkylated des-amino Cys spacer, i.e., an alkylated 3-mercaptopropionicacid spacer. Suitable methods of peptide acylation via amines,hydroxyls, and thiols are known in the art. See, for example, Miller,Biochem Biophys Res Commun 218: 377-382 (1996); Shimohigashi andStammer, Int J Pept Protein Res 19: 54-62 (1982); and Previero et al.,Biochim Biophys Acta 263: 7-13 (1972) (for methods of acylating througha hydroxyl); and San and Silvius, J Pept Res 66: 169-180 (2005) (formethods of acylating through a thiol); Bioconjugate Chem. “ChemicalModifications of Proteins: History and Applications” pages 1, 2-12(1990); Hashimoto et al., Pharmaceutical Res. “Synthesis of PalmitoylDerivatives of Insulin and their Biological Activity” Vol. 6, No: 2 pp.171-176 (1989).

The acyl group of the acylated peptide the single chain insulin analogcan be of any size, e.g., any length carbon chain, and can be linear orbranched. In some specific embodiments of the invention, the acyl groupis a C₄ to C₃₀ fatty acid. For example, the acyl group can be any of aC₄ fatty acid, C₆ fatty acid, C₈ fatty acid, C₁₀ fatty acid, C₁₂ fattyacid, C₁₄ fatty acid, C₁₆ fatty acid, C₁₈ fatty acid, C₂₀ fatty acid,C₂₂ fatty acid, C₂₄ fatty acid, C₂₆ fatty acid, C₂₈ fatty acid, or a C₃₀fatty acid. In some embodiments, the acyl group is a C₈ to C₂₀ fattyacid, e.g., a C₁₄ fatty acid or a C₁₆ fatty acid.

In an alternative embodiment, the acyl group is a bile acid. The bileacid can be any suitable bile acid, including, but not limited to,cholic acid, chenodeoxycholic acid, deoxycholic acid, lithocholic acid,taurocholic acid, glycocholic acid, and cholesterol acid.

The acylated single chain insulin analog described herein can be furthermodified to comprise a hydrophilic moiety. In some specific embodimentsthe hydrophilic moiety can comprise a polyethylene glycol (PEG) chain.The incorporation of a hydrophilic moiety can be accomplished throughany suitable means, such as any of the methods described herein. In someembodiments the acylated single chain analog comprises an amino acidselected from the group consisting of a Cys, Lys, Orn, homo-Cys, orAc-Phe, and the side chain of the amino acid is covalently bonded to ahydrophilic moiety (e.g., PEG). In one embodiment, the acyl group isattached to position A14, A15, B1 (for insulin based B chains), B2 (forIGF-1 based B chains), B10, B22, B28 or B29 (according to the amino acidnumbering of the A and B chains of native insulin), optionally via aspacer comprising Cys, Lys, Orn, homo-Cys, or Ac-Phe.

Alternatively, the acylated single chain insulin analog comprises aspacer, wherein the spacer is both acylated and modified to comprise thehydrophilic moiety. Nonlimiting examples of suitable spacers include aspacer comprising one or more amino acids selected from the groupconsisting of Cys, Lys, Orn, homo-Cys, and Ac-Phe.

Alkylation of the Single Chain Insulin Analog

In some embodiments, the single chain insulin analog is modified tocomprise an alkyl group. The alkyl group can be covalently linkeddirectly to an amino acid of the single chain insulin analog, orindirectly to an amino acid of the single chain insulin analog via aspacer, wherein the spacer is positioned between the amino acid of thesingle chain insulin analog and the alkyl group. The alkyl group can beattached to the single chain insulin analog via an ether, thioether, oramino linkage. For example, the single chain insulin analog may bealkylated at the same amino acid position where a hydrophilic moiety islinked, or at a different amino acid position. Alkylation can be carriedout at any position within the single chain insulin analog, includingfor example in the C-terminal region of the B chain or at a position inthe linking moiety, provided that insulin activity is retained. In aspecific aspect of the invention, the single chain insulin analog ismodified to comprise an alkyl group by direct alkylation of an amine,hydroxyl, or thiol of a side chain of an amino acid of the single chaininsulin analog. In some embodiments, the single chain insulin analog isdirectly alkylated through the side chain amine, hydroxyl, or thiol ofan amino acid. In some specific embodiments of the invention, the directalkylation of the single chain insulin analog occurs through the sidechain amine, hydroxyl, or thiol of the amino acid at position A14, A15,B1 (for insulin based B chains), B2 (for IGF-1 based b chains), B10,B22, B28 or B29 (according to the amino acid numbering of the A and Bchain of native insulin).

In some embodiments, the amino acid of the single chain insulin analogcomprises an amino acid selected from of Formula I, Formula II, andFormula III, and the alkyl group is linked through the amino, hydroxylor thiol group contained in Formula I, Formula II, and Formula III,respectively. In some exemplary embodiments, the amino acid of FormulaI, is the amino acid wherein n is 4 (Lys) or n is 3 (Orn). In someexemplary embodiments, the amino acid of Formula II is the amino acidwherein n is 1 (Ser). In some exemplary embodiments, the amino acid ofFormula II is the amino acid wherein n is 1 (Cys). In yet otherembodiments, the amino acid of peptide the single chain insulin analogcomprising a side chain amine, hydroxyl, or thiol is a disubstitutedamino acid comprising the same structure of Formula I, Formula II, orFormula III, except that the hydrogen bonded to the alpha carbon of theamino acid of Formula I, Formula II, or Formula III is replaced with asecond side chain.

In some embodiments of the invention, the single chain insulin analogcomprises a spacer between the peptide and the alkyl group. In someembodiments, the single chain insulin analog is covalently bound to thespacer, which is covalently bound to the alkyl group. In some exemplaryembodiments, the single chain insulin analog is modified to comprise analkyl group by alkylation of an amine, hydroxyl, or thiol of a spacer,wherein the spacer is attached to a side chain of an amino acid atposition A14, A15, B1 (for insulin based b chains), B2 (for IGF-1 basedB chains), B10, B22, B28 or B29 (according to the amino acid numberingof the A and B chains of native insulin). The amino acid of the singlechain insulin analog to which the spacer is attached can be any aminoacid (e.g., a singly α-substituted amino acid or an α,α-disubstitutedamino acid) comprising a moiety which permits linkage to the spacer. Anamino acid of the single chain insulin analog comprising a side chain—NH₂, —OH, or —COOH (e.g., Lys, Orn, Ser, Asp, or Glu) is suitable. Insome embodiments, the spacer between the peptide the single chaininsulin analog and the alkyl group is an amino acid comprising a sidechain amine, hydroxyl, or thiol or a dipeptide or tripeptide comprisingan amino acid comprising a side chain amine, hydroxyl, or thiol.

In the instance in which the alpha amine is alkylated, the spacer aminoacid can be any amino acid. For example, the spacer amino acid can be ahydrophobic amino acid, e.g., Gly, Ala, Val, Leu, Ile, Trp, Met, Phe,Tyr. Alternatively, the spacer amino acid can be an acidic residue,e.g., Asp and Glu. In exemplary embodiments, the spacer amino acid canbe a hydrophobic amino acid, e.g., Gly, Ala, Val, Leu, Ile, Trp, Met,Phe, Tyr, 6-amino hexanoic acid, 5-aminovaleric acid, 7-aminoheptanoicacid, 8-aminooctanoic acid. Alternatively, the spacer amino acid can bean acidic residue, e.g., Asp and Glu, provided that the alkylationoccurs on the alpha amine of the acidic residue. In the instance inwhich the side chain amine of the spacer amino acid is alkylated, thespacer amino acid is an amino acid comprising a side chain amine, e.g.,an amino acid of Formula I (e.g., Lys or Orn). In this instance, it ispossible for both the alpha amine and the side chain amine of the spaceramino acid to be alkylated, such that the peptide is dialkylated.Embodiments of the invention include such dialkylated molecules.

When alkylation occurs through a hydroxyl group of the amino acid of thespacer, the amino acid or one of the amino acids of the spacer can be anamino acid of Formula II. In a specific exemplary embodiment, the aminoacid is Ser.

When alkylation occurs through a thiol group of the amino acid of thespacer, the amino acid or one of the amino acids of the spacer can be anamino acid of Formula III. In a specific exemplary embodiment, the aminoacid is Cys.

In some embodiments, the spacer comprises a hydrophilic bifunctionalspacer. In a specific embodiment, the spacer comprises an aminopoly(alkyloxy)carboxylate. In this regard, the spacer can comprise, forexample, NH₂(CH₂CH₂O)_(n)(CH₂)_(m)COOH, wherein m is any integer from 1to 6 and n is any integer from 2 to 12, such as, e.g.,8-amino-3,6-dioxaoctanoic acid, which is commercially available fromPeptides International, Inc. (Louisville, Ky.). In some embodiments, thespacer between peptide the single chain insulin analog and the alkylgroup is a hydrophilic bifunctional spacer. In certain embodiments, thehydrophilic bifunctional spacer comprises two or more reactive groups,e.g., an amine, a hydroxyl, a thiol, and a carboxyl group or anycombinations thereof. In certain embodiments, the hydrophilicbifunctional spacer comprises a hydroxyl group and a carboxylate. Inother embodiments, the hydrophilic bifunctional spacer comprises anamine group and a carboxylate. In other embodiments, the hydrophilicbifunctional spacer comprises a thiol group and a carboxylate.

In some embodiments, the spacer between peptide the single chain insulinanalog and the alkyl group is a hydrophobic bifunctional spacer. Incertain embodiments, the hydrophobic bifunctional spacer comprises twoor more reactive groups, e.g., an amine, a hydroxyl, a thiol, and acarboxyl group or any combinations thereof. In certain embodiments, thehydrophobic bifunctional spacer comprises a hydroxyl group and acarboxylate. In other embodiments, the hydrophobic bifunctional spacercomprises an amine group and a carboxylate. In other embodiments, thehydrophobic bifunctional spacer comprises a thiol group and acarboxylate. Suitable hydrophobic bifunctional spacers comprising acarboxylate and a hydroxyl group or a thiol group are known in the artand include, for example, 8-hydroxyoctanoic acid and 8-mercaptooctanoicacid.

The spacer (e.g., amino acid, dipeptide, tripeptide, hydrophilicbifunctional spacer, or hydrophobic bifunctional spacer) is 3 to 10atoms (e.g., 6 to 10 atoms, (e.g., 6, 7, 8, 9, or 10 atoms)) in length.In more specific embodiments, the spacer is about 3 to 10 atoms (e.g., 6to 10 atoms) in length and the alkyl is a C₁₂ to C₁₈ alkyl group, e.g.,C₁₄ alkyl group, C₁₆ alkyl group, such that the total length of thespacer and alkyl group is 14 to 28 atoms, e.g., about 14, 15, 16, 17,18, 19, 20, 21, 22, 23, 24, 25, 26, 27, or 28 atoms. In some embodimentsthe length of the spacer and alkyl is 17 to 28 (e.g., 19 to 26, 19 to21) atoms.

In accordance with one embodiment the bifunctional spacer is a syntheticor non-naturally occurring amino acid comprising an amino acid backbonethat is 3 to 10 atoms in length (e.g., 6-amino hexanoic acid,5-aminovaleric acid, 7-aminoheptanoic acid, and 8-aminooctanoic acid).Alternatively, the spacer can be a dipeptide or tripeptide spacer havinga peptide backbone that is 3 to 10 atoms (e.g., 6 to 10 atoms) inlength. The dipeptide or tripeptide spacer attached to the single chaininsulin analog can be composed of naturally-occurring and/ornon-naturally occurring amino acids, including, for example, any of theamino acids taught herein. In some embodiments the spacer comprises anoverall negative charge, e.g., comprises one or two negatively chargedamino acids. In some embodiments the dipeptide spacer is selected fromthe group consisting of: Ala-Ala, β-Ala-β-Ala, Leu-Leu, Pro-Pro,γ-aminobutyric acid-γ-aminobutyric acid, and γ-Glu-γ-Glu. In oneembodiment the dipeptide spacer is γ-Glu-γ-Glu.

Suitable methods of peptide alkylation via amines, hydroxyls, and thiolsare known in the art. For example, a Williamson ether synthesis can beused to form an ether linkage between the insulin peptide and the alkylgroup. Also, a nucleophilic substitution reaction of the peptide with analkyl halide can result in any of an ether, thioether, or amino linkage.The alkyl group of the alkylated peptide the single chain insulin analogcan be of any size, e.g., any length carbon chain, and can be linear orbranched. In some embodiments of the invention, the alkyl group is a C₄to C₃₀ alkyl. For example, the alkyl group can be any of a C₄ alkyl, C₆alkyl, C₈ alkyl, C₁₀ alkyl, C₁₂ alkyl, C₁₄ alkyl, C₁₆ alkyl, C₁₈ alkyl,C₂₀ alkyl, C₂₂ alkyl, C₂₄ alkyl, C₂₆ alkyl, C₂₈ alkyl, or a C₃₀ alkyl.In some embodiments, the alkyl group is a C₈ to C₂₀ alkyl, e.g., a C₁₄alkyl or a C₁₆ alkyl.

In some specific embodiments, the alkyl group comprises a steroid moietyof a bile acid, e.g., cholic acid, chenodeoxycholic acid, deoxycholicacid, lithocholic acid, taurocholic acid, glycocholic acid, andcholesterol acid.

In some embodiments the single chain insulin analog is modified tocomprise an alkyl group by reacting a nucleophilic, long chain alkanewith the single chain insulin analog, wherein the single chain insulinanalog comprises a leaving group suitable for nucleophilic substitution.In specific aspects, the nucleophilic group of the long chain alkanecomprises an amine, hydroxyl, or thiol group (e.g. octadecylamine,tetradecanol, and hexadecanethiol). The leaving group of the singlechain insulin analog can be part of a side chain of an amino acid or canbe part of the peptide backbone. Suitable leaving groups include, forexample, N-hydroxysuccinimide, halogens, and sulfonate esters.

In certain embodiments, the single chain insulin analog is modified tocomprise an alkyl group by reacting the nucleophilic, long chain alkanewith a spacer, which is attached to the single chain insulin analog,wherein the spacer comprises the leaving group. In specific aspects, thelong chain alkane comprises an amine, hydroxyl, or thiol group. Incertain embodiments, the spacer comprising the leaving group can be anyspacer discussed herein, e.g., amino acids, dipeptides, tripeptides,hydrophilic bifunctional spacers and hydrophobic bifunctional spacersfurther comprising a suitable leaving group.

When a long chain alkane is alkylated by the single chain insulin analogor the spacer, the long chain alkane may be of any size and can compriseany length of carbon chain. The long chain alkane can be linear orbranched. In certain aspects, the long chain alkane is a C₄ to C₃₀alkane. For example, the long chain alkane can be any of a C₄ alkane, C₆alkane, C₈ alkane, C₁₀ alkane, C₁₂ alkane, C₁₄ alkane, C₁₆ alkane, C₁₈alkane, C₂₀ alkane, C₂₂ alkane, C₂₄ alkane, C₂₆ alkane, C₂₈ alkane, or aC₃₀ alkane. In some embodiments the long chain alkane comprises a C₈ toC₂₀ alkane, e.g., a C₁₄ alkane, C₁₆ alkane, or a C₁₈ alkane.

Also, in some embodiments alkylation can occur between the single chaininsulin analog and a cholesterol moiety. For example, the hydroxyl groupof cholesterol can displace a leaving group on the long chain alkane toform a cholesterol-insulin peptide product. The alkylated single chaininsulin analogs described herein can be further modified to comprise ahydrophilic moiety. In some specific embodiments the hydrophilic moietycan comprise a polyethylene glycol (PEG) chain. The incorporation of ahydrophilic moiety can be accomplished through any suitable means, suchas any of the methods described herein. In some embodiments the singlechain insulin analog can comprise an amino acid selected from Cys, Lys,Orn, homo-Cys, or Ac-Phe, wherein the side chain of the amino acid iscovalently bonded to a hydrophilic moiety (e.g., PEG). In someembodiments the alkyl group is attached to position A14, A15, B1 (forinsulin based B chains), B2 (for IGF-1 based B chains), B10, B22, B28 orB29 (according to the amino acid numbering of the A or B chain of nativeinsulin), optionally via a spacer comprising Cys, Lys, Orn, homo-Cys, orAc-Phe, and optionally further comprising a hydrophilic moiety linked tothe side chain of another amino acid. Alternatively, the alkylatedsingle chain insulin analog can comprise a spacer, wherein the spacer isboth alkylated and modified to comprise the hydrophilic moiety.Nonlimiting examples of suitable spacers include a spacer comprising oneor more amino acids selected from the group consisting of Cys, Lys, Orn,homo-Cys, and Ac-Phe.

Conjugates

In some embodiments, the single chain insulin analogs described hereinare glycosylated, amidated, carboxylated, phosphorylated, esterified,N-acylated, cyclized via, e.g., a disulfide bridge, or converted into asalt (e.g., an acid addition salt, a basic addition salt), and/oroptionally dimerized, multimerized, or polymerized, or conjugated. Thepresent disclosure also encompasses conjugates in which the single chaininsulin analog is linked to a heterologous moiety. The conjugationbetween the single chain insulin analog and the heterologous moiety canbe through covalent bonding, non-covalent bonding (e.g. electrostaticinteractions, hydrogen bonds, van der Waals interactions, salt bridges,hydrophobic interactions, and the like), or both types of bonding. Avariety of non-covalent coupling systems may be used, includingbiotin-avidin, ligand/receptor, enzyme/substrate, nucleic acid/nucleicacid binding protein, lipid/lipid binding protein, cellular adhesionmolecule partners; or any binding partners or fragments thereof whichhave affinity for each other. In some aspects, the covalent bonds arepeptide bonds. The conjugation of the single chain insulin analog to theheterologous moiety may be indirect or direct conjugation, the former ofwhich may involve a linker or spacer. Suitable linkers and spacers areknown in the art and include, but not limited to, any of the linkers orspacers described.

As used herein, the term “heterologous moiety” is synonymous with theterm “conjugate moiety” and refers to any molecule (chemical orbiochemical, naturally-occurring or non-coded) which is different fromthe single chain insulin analog to which it is attached. Exemplaryconjugate moieties that can be linked to the single chain insulin analoginclude but are not limited to a heterologous peptide or polypeptide(including for example, a plasma protein), a targeting agent, animmunoglobulin or portion thereof (e.g., variable region, CDR, or Fcregion), a diagnostic label such as a radioisotope, fluorophore orenzymatic label, a polymer including water soluble polymers, or othertherapeutic or diagnostic agents. In some embodiments a conjugate isprovided comprising the single chain insulin analog and a plasmaprotein, wherein the plasma protein is selected from the groupconsisting of albumin, transferin, fibrinogen and globulins. In someembodiments the plasma protein moiety of the conjugate is albumin ortransferin. In one embodiment the heterologous moiety is albumin,including for example, albumins such as human serum albumin (HSA) andrecombinant human albumin (rHA). The conjugate in some embodimentscomprises the single chain insulin analog and one or more of apolypeptide, a nucleic acid molecule, an antibody or fragment thereof, apolymer, a the single chain insulin analoguantum dot, a small molecule,a toxin, a diagnostic agent, a carbohydrate, an amino acid.

Polymer Heterologous Moiety

In some embodiments, the heterologous moiety conjugated to the singlechain insulin analog is a polymer. In some embodiments, the polymer isselected from the group consisting of: polyamides, polycarbonates,polyalkylenes and derivatives thereof including, polyalkylene glycols,polyalkylene oxides, polyalkylene terepthalates, polymers of acrylic andmethacrylic esters, including poly(methyl methacrylate), poly(ethylmethacrylate), poly(butylmethacrylate), poly(isobutyl methacrylate),poly(hexylmethacrylate), poly(isodecyl methacrylate), poly(laurylmethacrylate), poly(phenyl methacrylate), poly(methyl acrylate),poly(isopropyl acrylate), poly(isobutyl acrylate), and poly(octadecylacrylate), polyvinyl polymers including polyvinyl alcohols, polyvinylethers, polyvinyl esters, polyvinyl halides, poly(vinyl acetate), andpolyvinylpyrrolidone, polyglycolides, polysiloxanes, polyurethanes andco-polymers thereof, celluloses including alkyl cellulose, hydroxyalkylcelluloses, cellulose ethers, cellulose esters, nitro celluloses, methylcellulose, ethyl cellulose, hydroxypropyl cellulose, hydroxy-propylmethyl cellulose, hydroxybutyl methyl cellulose, cellulose acetate,cellulose propionate, cellulose acetate butyrate, cellulose acetatephthalate, carboxylethyl cellulose, cellulose triacetate, and cellulosesulphate sodium salt, polypropylene, polyethylenes includingpoly(ethylene glycol), poly(ethylene oxide), and poly(ethyleneterephthalate), and polystyrene.

In some aspects, the polymer is a biodegradable polymer, including asynthetic biodegradable polymer (e.g., polymers of lactic acid andglycolic acid, polyanhydrides, poly(ortho)esters, polyurethanes,poly(butic acid), poly(valeric acid), and poly(lactide-cocaprolactone)),and a natural biodegradable polymer (e.g., alginate and otherpolysaccharides including dextran and cellulose, collagen, chemicalderivatives thereof (substitutions, additions of chemical groups, forexample, alkyl, alkylene, hydroxylations, oxidations, and othermodifications routinely made by those skilled in the art), albumin andother hydrophilic proteins (e.g., zein and other prolamines andhydrophobic proteins)), as well as any copolymer or mixture thereof. Ingeneral, these materials degrade either by enzymatic hydrolysis orexposure to water in vivo, by surface or bulk erosion.

In some aspects, the polymer is a bioadhesive polymer, such as abioerodible hydrogel described by H. S. Sawhney, C. P. Pathak and J. A.Hubbell in Macromolecules, 1993, 26, 581-587, the teachings of which areincorporated herein, polyhyaluronic acids, casein, gelatin, glutin,polyanhydrides, polyacrylic acid, alginate, chitosan, poly(methylmethacrylates), poly(ethyl methacrylates), poly(butylmethacrylate),poly(isobutyl methacrylate), poly(hexylmethacrylate), poly(isodecylmethacrylate), poly(lauryl methacrylate), poly(phenyl methacrylate),poly(methyl acrylate), poly(isopropyl acrylate), poly(isobutylacrylate), and poly(octadecyl acrylate).

In some embodiments, the polymer is a water-soluble polymer or ahydrophilic polymer. Hydrophilic polymers are further described hereinunder “Hydrophilic Heterologous Moieties.” Suitable water-solublepolymers are known in the art and include, for example,polyvinylpyrrolidone, hydroxypropyl cellulose (HPC; Klucel),hydroxypropyl methylcellulose (HPMC; Methocel), nitrocellulose,hydroxypropyl ethylcellulose, hydroxypropyl butylcellulose,hydroxypropyl pentylcellulose, methyl cellulose, ethylcellulose(Ethocel), hydroxyethyl cellulose, various alkyl celluloses andhydroxyalkyl celluloses, various cellulose ethers, cellulose acetate,carboxymethyl cellulose, sodium carboxymethyl cellulose, calciumcarboxymethyl cellulose, vinyl acetate/crotonic acid copolymers,poly-hydroxyalkyl methacrylate, hydroxymethyl methacrylate, methacrylicacid copolymers, polymethacrylic acid, polymethylmethacrylate, maleicanhydride/methyl vinyl ether copolymers, poly vinyl alcohol, sodium andcalcium polyacrylic acid, polyacrylic acid, acidic carboxy polymers,carboxypolymethylene, carboxyvinyl polymers, polyoxyethylenepolyoxypropylene copolymer, polymethylvinylether co-maleic anhydride,carboxymethylamide, potassium methacrylate divinylbenzene co-polymer,polyoxyethyleneglycols, polyethylene glycol, and derivatives, salts, andcombinations thereof.

In one embodiment, the polymer is a polyalkylene glycol, including, forexample, polyethylene glycol (PEG). In some embodiments, theheterologous moiety is a carbohydrate. In some embodiments, thecarbohydrate is a monosaccharide (e.g., glucose, galactose, fructose), adisaccharide (e.g., sucrose, lactose, maltose), an oligosaccharide(e.g., raffinose, stachyose), a polysaccharide (a starch, amylase,amylopectin, cellulose, chitin, callose, laminarin, xylan, mannan,fucoidan, galactomannan.

In some embodiments, the heterologous moiety is a lipid. The lipid, insome embodiments, is a fatty acid, eicosanoid, prostaglandin,leukotriene, thromboxane, N-acyl ethanolamine), glycerolipid (e.g.,mono-, di-, tri-substituted glycerols), glycerophospholipid (e.g.,phosphatidylcholine, phosphatidylinositol, phosphatidylethanolamine,phosphatidylserine), sphingolipid (e.g., sphingosine, ceramide), sterollipid (e.g., steroid, cholesterol), prenol lipid, saccharolipid, or apolyketide, oil, wax, cholesterol, sterol, fat-soluble vitamin,monoglyceride, diglyceride, triglyceride, a phospholipid.

Fc Fusion Heterologous Moiety

As noted above, in some embodiments the single chain insulin analog isconjugated, e.g., fused to an immunoglobulin or portion thereof (e.g.variable region, CDR, or Fc region). Known types of immunoglobulins (Ig)include IgG, IgA, IgE, IgD or IgM. The Fc region is a C-terminal regionof an Ig heavy chain, which is responsible for binding to Fc receptorsthat carry out activities such as recycling (which results in prolongedhalf-life), antibody dependent cell-mediated cytotoxicity (ADCC), andcomplement dependent cytotoxicity (CDC).

For example, according to some definitions the human IgG heavy chain Fcregion stretches from Cys226 to the C-terminus of the heavy chain. The“hinge region” generally extends from Glu216 to Pro230 of human IgG1(hinge regions of other IgG isotypes may be aligned with the IgG1sequence by aligning the cysteines involved in cysteine bonding). The Fcregion of an IgG includes two constant domains, CH2 and CH3. The CH2domain of a human IgG Fc region usually extends from amino acids 231 toamino acid 341. The CH3 domain of a human IgG Fc region usually extendsfrom amino acids 342 to 447. References made to amino acid numbering ofimmunoglobulins or immunoglobulin fragments, or regions, are all basedon Kabat et al. 1991, Sequences of Proteins of Immunological Interest,U.S. Department of Public Health, Bethesda, Md. In a related embodiment,the Fc region may comprise one or more native or modified constantregions from an immunoglobulin heavy chain, other than CH1, for example,the CH2 and CH3 regions of IgG and IgA, or the CH3 and CH4 regions ofIgE.

Suitable conjugate moieties include portions of immunoglobulin sequencethat include the FcRn binding site. FcRn, a salvage receptor, isresponsible for recycling immunoglobulins and returning them tocirculation in blood. The region of the Fc portion of IgG that binds tothe FcRn receptor has been described based on X-ray crystallography(Burmeister et al. 1994, Nature 372:379). The major contact area of theFc with the FcRn is near the junction of the CH2 and CH3 domains.Fc-FcRn contacts are all within a single Ig heavy chain. The majorcontact sites include amino acid residues 248, 250-257, 272, 285, 288,290-291, 308-311, and 314 of the CH2 domain and amino acid residues385-387, 428, and 433-436 of the CH3 domain.

Some conjugate moieties may or may not include FcγR binding site(s).FcγR are responsible for ADCC and CDC. Examples of positions within theFc region that make a direct contact with FcγR are amino acids 234-239(lower hinge region), amino acids 265-269 (B/C loop), amino acids297-299 (C′/E loop), and amino acids 327-332 (F/G) loop (Sondermann etal., Nature 406: 267-273, 2000). The lower hinge region of IgE has alsobeen implicated in the FcRI binding (Henry, et al., Biochemistry 36,15568-15578, 1997). Residues involved in IgA receptor binding aredescribed in Lewis et al., (J Immunol. 175:6694-701, 2005). Amino acidresidues involved in IgE receptor binding are described in Sayers et al.(J Biol Chem. 279(34):35320-5, 2004).

Amino acid modifications may be made to the Fc region of animmunoglobulin. Such variant Fc regions comprise at least one amino acidmodification in the CH3 domain of the Fc region (residues 342-447)and/or at least one amino acid modification in the CH2 domain of the Fcregion (residues 231-341). Mutations believed to impart an increasedaffinity for FcRn include T256A, T307A, E380A, and N434A (Shields et al.2001, J. Biol. Chem. 276:6591). Other mutations may reduce binding ofthe Fc region to FcγRI, FcγRIIA, FcγRIIB, and/or FcγRIIIA withoutsignificantly reducing affinity for FcRn. For example, substitution ofthe Asn at position 297 of the Fc region with Ala or another amino acidremoves a highly conserved N-glycosylation site and may result inreduced immunogenicity with concomitant prolonged half-life of the Fcregion, as well as reduced binding to FcγRs (Routledge et al. 1995,Transplantation 60:847; Friend et al. 1999, Transplantation 68:1632;Shields et al. 1995, J. Biol. Chem. 276:6591). Amino acid modificationsat positions 233-236 of IgG1 have been made that reduce binding to FcγRs(Ward and Ghetie 1995, Therapeutic Immunology 2:77 and Armour et al.1999, Eur. J. Immunol. 29:2613). Some exemplary amino acid substitutionsare described in U.S. Pat. Nos. 7,355,008 and 7,381,408, eachincorporated by reference herein in its entirety.

Hydrophilic Heterologous Moiety

In some embodiments, the single chain insulin analog described herein iscovalently bonded to a hydrophilic moiety. Hydrophilic moieties can beattached to the single chain insulin analog under any suitableconditions used to react a protein with an activated polymer molecule.Any means known in the art can be used, including via acylation,reductive alkylation, Michael addition, thiol alkylation or otherchemoselective conjugation/ligation methods through a reactive group onthe hydrophilic moiety (e.g., an aldehyde, amino, ester, thiol,α-haloacetyl, maleimido or hydrazino group) to a reactive group on thetarget compound (e.g., an aldehyde, amino, ester, thiol, α-haloacetyl,maleimido or hydrazino group). Activating groups which can be used tolink the water soluble polymer to one or more proteins include withoutlimitation sulfone, maleimide, sulfhydryl, thiol, triflate, tresylate,azidirine, oxirane, 5-pyridyl, and alpha-halogenated acyl group (e.g.,alpha-iodo acetic acid, alpha-bromoacetic acid, alpha-chloroaceticacid). If attached to the peptide by reductive alkylation, the polymerselected should have a single reactive aldehyde so that the degree ofpolymerization is controlled. See, for example, Kinstler et al., Adv.Drug. Delivery Rev. 54: 477-485 (2002); Roberts et al., Adv. DrugDelivery Rev. 54: 459-476 (2002); and Zalipsky et al., Adv. DrugDelivery Rev. 16: 157-182 (1995).

The hydrophilic moiety, e.g., polyethylene glycol chain, in accordancewith some embodiments has a molecular weight selected from the range ofabout 500 to about 40,000 Daltons. In some embodiments the polyethyleneglycol chain has a molecular weight selected from the range of about 500to about 5,000 Daltons, or about 1,000 to about 5,000 Daltons. Inanother embodiment the hydrophilic moiety, e.g., polyethylene glycolchain, has a molecular weight of about 10,000 to about 20,000 Daltons.In yet other exemplary embodiments the hydrophilic moiety, e.g.polyethylene glycol chain, has a molecular weight of about 20,000 toabout 40,000 Daltons. Linear or branched hydrophilic polymers arecontemplated. Resulting preparations of conjugates may be essentiallymonodisperse or polydisperse, and may have about 0.5, 0.7, 1, 1.2, 1.5or 2 polymer moieties per peptide.

In some embodiments, the native amino acid of the peptide is substitutedwith an amino acid having a side chain suitable for crosslinking withhydrophilic moieties, to facilitate linkage of the hydrophilic moiety tothe peptide. Exemplary amino acids include Cys, Lys, Orn, homo-Cys, oracetyl phenylalanine (Ac-Phe). In other embodiments, an amino acidmodified to comprise a hydrophilic group is added to the peptide at theN-terminus or C-terminus. In some embodiments, the peptide of theconjugate is conjugated to a hydrophilic moiety, e.g. PEG, via covalentlinkage between a side chain of an amino acid of the peptide and thehydrophilic moiety.

rPEG Heterologous Moiety

In some embodiments, the conjugate comprises a single chain insulinanalog fused to an accessory peptide which is capable of forming anextended conformation similar to chemical PEG (e.g., a recombinant PEG(rPEG) molecule), such as those described in International PatentApplication Publication No. WO2009/023270 and U.S. Patent ApplicationPublication No. US2008/0286808. The rPEG molecule is not polyethyleneglycol. The rPEG molecule in some aspects is a polypeptide comprisingone or more of glycine, serine, glutamic acid, aspartic acid, alanine,or proline. In some aspects, the rPEG is a homopolymer, e.g.,poly-glycine, poly-serine, poly-glutamic acid, poly-aspartic acid,poly-alanine, or poly-proline. In other embodiments, the rPEG comprisestwo types of amino acids repeated, e.g., poly(Gly-Ser), poly(Gly-Glu),poly(Gly-Ala), poly(Gly-Asp), poly(Gly-Pro), poly(Ser-Glu), etc. In someaspects, the rPEG comprises three different types of amino acids, e.g.,poly(Gly-Ser-Glu). In specific aspects, the rPEG increases the half-lifeof the single chain insulin analog. In some aspects, the rPEG comprisesa net positive or net negative charge. The rPEG in some aspects lackssecondary structure. In some embodiments, the rPEG is greater than orequal to 10 amino acids in length and in some embodiments is about 40 toabout 50 amino acids in length. The accessory peptide in some aspects isfused to the N- or C-terminus of the peptide of the invention through apeptide bond or a proteinase cleavage site. The rPEG in some aspectscomprises an affinity tag or is linked to a PEG that is greater than 5kDa. In some embodiments, the rPEG confers the conjugate of theinvention with an increased hydrodynamic radius, serum half-life,protease resistance, or solubility and in some aspects confers theconjugate with decreased immunogenicity.

The conjugate moieties can be linked to the single chain insulin analogvia direct covalent linkage by reacting targeted amino acid residues ofthe peptide with an organic derivatizing agent that is capable ofreacting with selected side chains or the N- or C-terminal residues ofthese targeted amino acids. Reactive groups on the peptide or conjugatemoiety include, e.g., an aldehyde, amino, ester, thiol, α-haloacetyl,maleimido or hydrazino group. Derivatizing agents include, for example,maleimidobenzoyl sulfosuccinimide ester (conjugation through cysteineresidues), N-hydroxysuccinimide (through lysine residues),glutaraldehyde, succinic anhydride or other agents known in the art.Alternatively, the conjugate moieties can be linked to the peptideindirectly through intermediate carriers, such as polysaccharide orpolypeptide carriers. Examples of polysaccharide carriers includeaminodextran. Examples of suitable polypeptide carriers includepolylysine, polyglutamic acid, polyaspartic acid, co-polymers thereof,and mixed polymers of these amino acids and others, e.g., serines, toconfer desirable solubility properties on the resultant loaded carrier.

Multimers

The single chain insulin analog may be part of a dimer, trimer or higherorder multimer comprising at least two, three, or more peptides boundvia a linker, wherein at least one or both peptides is a single chaininsulin analog. In one embodiment a single chain insulin analog islinked to either the A chain or the B chain of a second insulinpolypeptide that is either a heteroduplex comprising the A and B chainor a second single chain insulin analog. The dimer may be a homodimer orheterodimer. In some embodiments, the linker is selected from the groupconsisting of a bifunctional thiol crosslinker and a bi-functional aminecrosslinker. In certain embodiments, the linker is PEG, e.g., a 5 kDaPEG, 20 kDa PEG. In some embodiments, the linker is a disulfide bond.For example, each monomer of the dimer may comprise a Cys residue (e.g.,a terminal or internally positioned Cys) and the sulfur atom of each Cysresidue participates in the formation of the disulfide bond. In someaspects of the invention, the monomers are connected via terminal aminoacids (e.g., N-terminal or C-terminal), via internal amino acids, or viaa terminal amino acid of at least one monomer and an internal amino acidof at least one other monomer. In specific aspects, the monomers are notconnected via an N-terminal amino acid. In some aspects, the monomers ofthe multimer are attached together in a “tail-to-tail” orientation inwhich the C-terminal amino acids of each monomer are attached together.In one embodiment the dimer comprises two single chain insulin analogswherein the two insulin analogs are linked to one another via the aminoacid side chains of an amino acid present in the linking moiety of eachsingle chain insulin analog. A conjugate moiety may be covalently linkedto any of the single chain insulin analogs described herein, including adimer, trimer or higher order multimer.

In accordance with one embodiment a multimer is provided that comprisesan IGF YL B chain analog disclosed herein (including prodrug and depotderivatives thereof). The multimer (e.g., a dimer) may be a homodimer orheterodimer, comprising peptides selected from the group consisting ofnative insulin, native IGF-1, native IGF-II, an insulin analog peptideand IGF analog peptides. In some embodiments, the linker is selectedfrom the group consisting of a bifunctional thiol crosslinker and abi-functional amine crosslinker. In certain embodiments, the linker isPEG, e.g., a 5 kDa PEG, 20 kDa PEG. In some embodiments, the linker is adisulfide bond.

Controlled Release Formulations

Alternatively, the single chain insulin analogs described herein can bemodified into a depot form, such that the manner in which the conjugateof the present disclosure is released into the body to which it isadministered is controlled with respect to time and location within thebody (see, for example, U.S. Pat. No. 4,450,150). Depot forms of theconjugates of the present disclosures can be, for example, animplantable composition comprising the conjugate of the presentdisclosure and a porous or non-porous material, such as a polymer,wherein the conjugate of the present disclosures is encapsulated by ordiffused throughout the material and/or degradation of the non-porousmaterial. The depot is then implanted into the desired location withinthe body and the conjugate of the present disclosures are released fromthe implant at a predetermined rate.

Alternatively, a large depot polymer can be linked to a self cleavingdipeptide element that is covalently bound to the single chain insulinanalog as described herein. In this embodiment, the depot polymereffectively sequesters the single chain insulin analog at its site ofadministration until it is subsequently cleaved from the single chainanalog via a non-enzymatic reaction at a predetermined rate. Depotformulations of insulin analogs using a self cleaving dipeptide havebeen described in PCT/US2009/068713, the disclosure of which isincorporated herein. In one embodiment a single chain insulin analog isprovided comprising a dipeptide prodrug element wherein the dipeptideprodrug element is linked to a large polymer such as PEG or dextran. Inone embodiment a self cleaving dipeptide element comprising a largedepot polymer (including for example, PEG) is linked to the side chainof an amino acid of the linking moiety, including for example, the aminoacid at position C8 of the linking moiety.

Pharmaceutical compositions can be prepared that comprise the singlechain analogs and are formulated to have a desired in vivo releaseprofile. In some aspects, the pharmaceutical composition is an immediaterelease, controlled release, sustained release, extended release,delayed release, or bi-phasic release formulation. Methods offormulating peptides or conjugates for controlled release are known inthe art. See, for example, J Pharm 374: 46-52 (2009) and InternationalPatent Application Publication Nos. WO 2008/130158, WO2004/033036;WO2000/032218; and WO 1999/040942. The instant compositions may furthercomprise, for example, micelles or liposomes, or some other encapsulatedform, or may be administered in an extended release form to provide aprolonged storage and/or delivery effect. The disclosed pharmaceuticalformulations may be administered according to any regime including, forexample, daily (1 time per day, 2 times per day, 3 times per day, 4times per day, 5 times per day, 6 times per day), every two days, everythree days, every four days, every five days, every six days, weekly,bi-weekly, every three weeks, monthly, or bi-monthly.

Prodrug Derivatives of Single Chain Insulin Analogs

The present disclosure also encompasses prodrug analogs of the singlechain insulin analog peptides disclosed herein. Advantageously, theprodrug formulations improve the therapeutic index of the underlyingpeptide and delay onset of action and enhance the half life of thesingle chain insulin analog peptide. The disclosed prodrug chemistry canbe chemically conjugated to active site amines to form amides thatrevert to the parent amine upon diketopiperazine formation and releaseof the prodrug element (see International patent applicationPCT/US2009/068713, the disclosure of which is expressly incorporatedherein). This novel biologically friendly prodrug chemistryspontaneously degrades under physiological conditions (e.g. pH of about7, at 37° C. in an aqueous environment) and is not reliant on enzymaticdegradation. The duration of the prodrug analog is determined by theselection of the dipeptide prodrug sequence, and thus allows forflexibility in prodrug formulation.

In one embodiment a prodrug is provided having a non-enzymaticactivation half time (t½) of between 1-100 hrs under physiologicalconditions. Physiological conditions as disclosed herein are intended toinclude a temperature of about 35 to 40° C. and a pH of about 7.0 toabout 7.4 and more typically include a pH of 7.2 to 7.4 and atemperature of 36 to 38° C. in an aqueous environment. In one embodimenta dipeptide, capable of undergoing diketopiperazine formation underphysiological conditions, is covalently linked through an amide or esterlinkage to the single chain insulin analog (see Internationalapplications WO 2009/099763 and PCT/US2009/068713 the disclosures ofwhich are incorporated herein).

Advantageously, the rate of cleavage, and thus activation of theprodrug, depends on the structure and stereochemistry of the dipeptidepro-moiety and also on the strength of the nucleophile. The prodrugsdisclosed herein will ultimately be chemically converted to structuresthat can be recognized by the insulin/IGF receptor, wherein the speed ofthis chemical conversion will determine the time of onset and durationof in vivo biological action. The prodrug chemistry disclosed in thisapplication relies upon an intramolecular chemical reaction that is notdependent upon additional chemical additives, or enzymes. The speed ofconversion is controlled by the chemical nature of the dipeptidesubstituent and its cleavage under physiological conditions. Sincephysiological pH and temperature are tightly regulated within a highlydefined range, the speed of conversion from prodrug to drug will exhibithigh intra and interpatient reproducibility. In one embodiment a prodrugderivative of the single chain analog is provided wherein the singlechain insulin analog comprises a linking moiety of 8-17 amino acidswherein one of the amino acids of the linking moiety is pegylated.Alternatively, or in addition to the pegylation of the amino acid of thelinking moiety, one of the two amino acids of the dipeptide prodrugelement can be pegylated. Alternatively, or in any combination with theabove mentioned pegylated sites, the single chain insulin prodrugderivative can be pegylated at a position selected from the groupconsisting of A9, A14 and A15 of the A chain, or positions B1 (forinsulin based B chains), B2 (for IGF-1 based B chains), B10, B22, B28 orB29 of the B chain. In one embodiment the insulin prodrug is pegylatedat the N-terminal alpha amine of the B chain (e.g. at position B1 forinsulin based B chain or position B2 for IGF-1 based B chain).

As disclosed herein prodrugs are provided wherein the single chaininsulin analog peptides have extended half lives of at least 1 hour, andmore typically greater than 20 hours but less than 100 hours, and areconverted to the active form at physiological conditions through anon-enzymatic reaction driven by inherent chemical instability. In oneembodiment the a non-enzymatic activation t½ time of the prodrug isbetween 1-100 hrs, and more typically between 12 and 72 hours, and inone embodiment the t½ is between 24-48 hrs as measured by incubating theprodrug in a phosphate buffer solution (e.g., PBS) at 37° C. and pH of7.2. In one embodiment the half life of the prodrugs is about 1, 8, 12,20, 24, 48 or 72 hours. In one embodiment the half life of the prodrugsis about 100 hours or greater including half lives of up to about 168,336, 504, 672 or 720 hours, and are converted to the active form atphysiological conditions through a non-enzymatic reaction driven byinherent chemical instability. The half lives of the various prodrugsare calculated by using the formula t_(1/2)=0.693/k, where l′ is thefirst order rate constant for the degradation of the prodrug. In oneembodiment, activation of the prodrug occurs after cleavage of an amidebond linked dipeptide, and formation of a diketopiperazine ordiketomorpholine, and the active single chain insulin analog peptide.

In another embodiment, the dipeptide prodrug element is covalently boundto the single chain insulin analog peptide via an amide linkage, and thedipeptide further comprises a depot polymer linked to dipeptide. In oneembodiment two or more depot polymers are linked to a single dipeptideelement. In one embodiment the depot polymer is linked to the side chainof one of the amino acids comprising the dipeptide prodrug element. Thedepot polymer is selected to be biocompatible and of sufficient sizethat the single chain insulin analog, modified by covalent attachment ofthe dipeptide, remains sequestered at an injection site and/or incapableof interacting with its corresponding receptor upon administration to apatient. Subsequent cleavage of the dipeptide releases the single chaininsulin analog to interact with its intended target. The depot bearingdipeptide element can be linked to the single chain insulin analog viaan amide bond through any convenient amine group of the single chaininsulin analog, including an N-terminal amine or an amine bearing sidechain of an internal natural or synthetic amino acid of the single chaininsulin analog. In one embodiment the depot bearing dipeptide element islinked to the amino group of a 4-amino phenylalanine present at positionA19 of the single chain analog.

In accordance with one embodiment the depot polymer is selected frombiocompatible polymers known to those skilled in the art. The depotpolymers typically have a size selected from a range of about 20,000 to120,000 Daltons. In one embodiment the depot polymer has a size selectedfrom a range of about 40,000 to 100,000 or about 40,000 to 80,000Daltons. In one embodiment the depot polymer has a size of about 40,000,50,000, 60,000, 70,000 or 80,000 Daltons. Suitable depot polymersinclude but are not limited to dextrans, polylactides, polyglycolides,caprolactone-based polymers, poly(caprolactone), polyanhydrides,polyamines, polyesteramides, polyorthoesters, polydioxanones,polyacetals, polyketals, polycarbonates, polyphosphoesters, polyesters,polybutylene terephthalate, polyorthocarbonates, polyphosphazenes,succinates, poly(malic acid), poly(amino acids), polyvinylpyrrolidone,polyethylene glycol, polyhydroxycellulose, polysaccharides, chitin,chitosan, hyaluronic acid, and copolymers, terpolymers and mixturesthereof, and biodegradable polymers and their copolymers includingcaprolactone-based polymers, polycaprolactones and copolymers whichinclude polybutylene terephthalate. In one embodiment the depot polymeris selected from the group consisting of polyethylene glycol, dextran,polylactic acid, polyglycolic acid and a copolymer of lactic acid andglycolic acid, and in one specific embodiment the depot polymer ispolyethylene glycol. In one embodiment the depot polymer is polyethyleneglycol and the combined molecular weight of depot polymer(s) linked tothe dipeptide element is about 40,000 to 80,000 Daltons.

Specific dipeptides composed of natural or synthetic amino acids havebeen identified that facilitate intramolecular decomposition underphysiological conditions to release the active single chain insulinanalog. The dipeptide can be linked (via an amide bond) to an aminogroup present on the single chain insulin analog, or an amino groupintroduced into the single chain insulin analog by modification of thepeptide sequence. In one embodiment the dipeptide structure is selectedto resist cleavage by peptidases present in mammalian sera, includingfor example dipeptidyl peptidase IV (DPP-IV). Accordingly, in oneembodiment the rate of cleavage of the dipeptide prodrug element fromthe bioactive peptide is not substantially enhanced (e.g., greater than2×) when the reaction is conducted using physiological conditions in thepresence of serum proteases relative to conducting the reaction in theabsence of the proteases. Thus the cleavage half-life of the dipeptideprodrug element from the single chain insulin analog (in PBS underphysiological conditions) is not more than two, three, four or five foldthe cleavage half-life of the dipeptide prodrug element from the singlechain insulin analog in a solution comprising a DPP-IV protease. In oneembodiment the solution comprising a DPP-IV protease is serum, moreparticularly mammalian serum, including human serum.

In accordance with one embodiment the dipeptide prodrug elementcomprises the structure U-B, wherein U is an amino acid or a hydroxylacid and B is an N-alkylated amino acid. The structure of U-B isselected, in one embodiment, wherein chemical cleavage of U-B from thesingle chain insulin analog is at least about 90% complete within about1 to about 720 hours in PBS under physiological conditions. In oneembodiment the chemical cleavage half-life (t_(1/2)) of U-B from thesingle chain insulin analog peptide is at least about 1 hour to about 1week in PBS under physiological conditions. In one embodiment U, B, orthe amino acid of the single chain insulin analog to which U-B is linkedis a non-coded amino acid. In some embodiments U and/or B is an aminoacid in the D stereoisomer configuration. In some exemplary embodiments,U is an amino acid in the D stereoisomer configuration and B is an aminoacid in the L stereoisomer configuration. In some exemplary embodiments,U is an amino acid in the L stereoisomer configuration and B is an aminoacid in the D stereoisomer configuration. In some exemplary embodiments,U is an amino acid in the D stereoisomer configuration and B is an aminoacid in the D stereoisomer configuration. In one embodiment B is anN-alkylated amino acid but is not proline. In one embodiment theN-alkylated group of amino acid B is a C₁-C₁₈ alkyl, and in oneembodiment the N-alkylated group is C₁-C₆ alkyl. In one embodiment U isan amino acid having a disubstitution at the alpha carbon.

In one embodiment one or more dipeptide elements are linked to singlechain insulin analog through an amide bond formed through one or moreamino groups selected from the N-terminal amino group of the B chain, orthe side chain amino group of an amino acid present in the single chaininsulin analog. In one embodiment the single chain insulin analogcomprises two dipeptide elements, wherein the dipeptide elements areoptionally pegylated, alkylated, acylated or linked to a depot polymer.In accordance with one embodiment the dipeptide extension is covalentlylinked to a single chain insulin analog through the side chain amine ofa lysine residue that resides at or near the active site. In oneembodiment the dipeptide extension is attached through a synthetic aminoacid or a modified amino acid, wherein the synthetic amino acid ormodified amino acid exhibits a functional group suitable for covalentattachment of the dipeptide extension (e.g., the aromatic amine of anamino-phenylalanine). In accordance with one embodiment one or moredipeptide elements are linked to the single chain insulin analog at anamino group selected from the N-terminal amino group of the B chain, orthe side chain amino group of an aromatic amine of a4-amino-phenylalanine residue present at a position corresponding toposition A19, B16 or B25 of native insulin.

The dipeptide prodrug element is designed to spontaneously cleave itsamide linkage to the insulin analog under physiological conditions andin the absence of enzymatic activity. In one embodiment the N-terminalamino acid of the dipeptide prodrug element comprises a C-alkylatedamino acid (e.g. amino isobutyric acid). In one embodiment theC-terminal amino acid of the dipeptide prodrug element comprises anN-alkylated amino acid (e.g., proline or N-methyl glycine). In oneembodiment the dipeptide comprises the sequence of an N-terminalC-alkylated amino acid followed by an N-alkylated amino acid.

Applicants have discovered that the selective insertion of a 4-aminophenylalanine amino acid moiety for the native tyrosine at position 19of the A chain can be accommodated without loss in potency of theinsulin peptide (see FIG. 3). Subsequent chemical amidation of thisactive site amino group with the dipeptide prodrug element disclosedherein dramatically lessens insulin receptor binding activity and thusprovides a suitable prodrug of insulin (see FIG. 7-12, data provided forthe IGF1Y¹⁶L¹⁷ (p-NH₂—F)^(A19) analog which has been demonstrated tohave comparable activity as insulin (p-NH₂—F)^(A19), see FIG. 4).Applicants have discovered that a similar modification can be made tothe IGF^(B16B17) analog peptides to provide a suitable attachment sitefor prodrug chemistry. Accordingly, in one embodiment the dipeptideprodrug element is linked to the aromatic ring of an A194-aminophenylalanine of an insulin (p-NH₂—F)^(A19) or IGF^(B16B17)single chain insulin analog peptide via an amide bond, wherein theC-terminal amino acid of the dipeptide comprises an N-alkylated aminoacid and the N-terminal amino acid of the dipeptide is any amino acid.

The dipeptide prodrug moiety can also be attached to additional sites ofan insulin (p-NH₂—F)^(A19) or IGF^(B16B17) single chain insulin analogpeptide to prepare insulin (p-NH₂—F)^(A19) or IGF^(B16B17) single chaininsulin analog prodrug derivatives. In accordance with one embodiment anIGF^(B16B17) single chain insulin analog prodrug derivative is providedcomprising an IGF^(B16B17) B chain with a dipeptide prodrug elementlinked via an amide bond to the N-terminal amino group of the B chain,or the side chain amino group of an aromatic amine of a4-amino-phenylalanine residue present at a position corresponding toA19, B16 or B25 of native insulin or present in the linking moiety. TheIGF^(B16B17) single chain insulin analog prodrug derivative may comprisea native insulin A chain or a native IGF-1 A chain or any analogsthereof disclosed herein. In one embodiment the dipeptide comprises anN-terminal C-alkylated amino acid followed by an N-alkylated amino acid.In accordance with one embodiment a single chain insulin analog prodrugderivative is provided wherein the A chain and B chain comprising theanalog comprises the sequence of SEQ ID NO: 1 and SEQ ID NO: 2,respectively, or may comprise an analog of SEQ ID NO: 1 and/or SEQ IDNO: 2 wherein the analogs include substitution of the amino acid atposition A19, B16 or B25 with a 4-amino phenylalanine and/or one or moreamino acid substitutions at positions corresponding to positions A5, A8,A9, A10, A14, A15, A17, A18, A19 and A21, B1, B2, B3, B4, B5, B9, B10,B13, B14, B20, B22, B23, B26, B27, B28, B29 and B30 of native insulin,or deletions of any or all of corresponding positions B1-4 and B26-30,relative to native insulin. In one embodiment the dipeptide is linked toan N-terminal amino group of the B chain of the single chain insulinanalog, wherein the C-terminal amino acid of the dipeptide comprises anN-alkylated amino acid and the N-terminal amino acid of the dipeptide isany amino acid, with the proviso that when the C-terminal amino acid ofthe dipeptide is proline, the N-terminal amino acid of the dipeptidecomprises a C-alkylated amino acid.

In one embodiment the dipeptide prodrug element comprises the generalstructure of Formula X:

wherein

R₁, R₂, R₄ and R₈ are independently selected from the group consistingof H, C₁-C₁₈ alkyl, C₂-C₁₈ alkenyl, (C₁-C₁₈ alkyl)OH, (C₁-C₁₈ alkyl)SH,(C₂-C₃ alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄ alkyl)COOH, (C₁-C₄alkyl)NH₂, (C₁-C₄ alkyl)NHC(NH₂ ⁺)NH₂, (C₀-C₄ alkyl)(C₃-C₆ cycloalkyl),(C₀-C₄ alkyl)(C₂-C₅ heterocyclic), (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, (C₁-C₄alkyl)(C₃-C₉ heteroaryl), and C₁-C₁₂ alkyl(W)C₁-C₁₂ alkyl, wherein W isa heteroatom selected from the group consisting of N, S and O, or R₁ andR₂ together with the atoms to which they are attached form a C₃-C₁₂cycloalkyl or aryl; or R₄ and R₈ together with the atoms to which theyare attached form a C₃-C₆ cycloalkyl;

R₃ is selected from the group consisting of C₁-C₁₈ alkyl, (C₁-C₁₈alkyl)OH, (C₁-C₁₈ alkyl)NH₂, (C₁-C₁₈ alkyl)SH, (C₀-C₄alkyl)(C₃-C₆)cycloalkyl, (C₀-C₄ alkyl)(C₂-C₅ heterocyclic), (C₀-C₄alkyl)(C₆-C₁₀ aryl)R₇, and (C₁-C₄ alkyl)(C₃-C₉ heteroaryl) or R₄ and R₃together with the atoms to which they are attached form a 4, 5 or 6member heterocyclic ring;

R₅ is NHR₆ or OH;

R₆ is H, C₁-C₈ alkyl or R₆ and R₂ together with the atoms to which theyare attached form a 4, 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of H and OH. In one embodimentwhen the prodrug element is linked to the N-terminal amine of the singlechain insulin analog and R₄ and R₃ together with the atoms to which theyare attached form a 4, 5 or 6 member heterocyclic ring, then at leastone of R₁ and R₂ are other than H.

In one embodiment the prodrug element of Formula X is provided whereinR₁ is selected from the group consisting of H and C₁-C₈ alkyl; and

-   -   R₂, R₈ and R₄ are independently selected from the group        consisting of H, C₁-C₈ alkyl, C₂-C₈ alkenyl, (C₁-C₄ alkyl)OH,        (C₁-C₄ alkyl)SH, (C₂-C₃ alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄        alkyl)COOH, (C₁-C₄ alkyl)NH₂, (C₁-C₄ alkyl)NHC(NH₂ ⁺) NH₂,        (C₀-C₄ alkyl)(C₃-C₆ cycloalkyl), (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇,        and CH₂(C₅-C₉ heteroaryl), or R₁ and R₂ together with the atoms        to which they are attached form a C₃-C₈ cycloalkyl ring;    -   R₃ is selected from the group consisting of C₁-C₈ alkyl, (C₁-C₄        alkyl)OH, (C₁-C₄ alkyl)SH, (C₁-C₄ alkyl)NH₂, (C₃-C₆)cycloalkyl        or R₄ and R₃ together with the atoms to which they are attached        form a 5 or 6 member heterocyclic ring;    -   R₅ is NHR₆ or OH;    -   R₆ is H, or R₆ and R₂ together with the atoms to which they are        attached form a 5 or 6 member heterocyclic ring; and    -   R₇ is selected from the group consisting of H and OH and R₈        is H. In one embodiment R₃ is C₁-C₈ alkyl and R₄ is selected        from the group consisting of H, C₁-C₆ alkyl, CH₂OH, (C₀-C₄        alkyl)(C₆-C₁₀ aryl)R₇, and CH₂(C₅-C₉ heteroaryl) or R₄ and R₃        together with the atoms to which they are attached form a 5 or 6        member heterocyclic ring. In a further embodiment R₅ is NHR₆ and        R₈ is H.

In accordance with one embodiment the dipeptide element comprises acompound having the general structure of Formula X:

wherein

R₁, R₂, R₄ and R₈ are independently selected from the group consistingof H, C₁-C₁₈ alkyl, C₂-C₁₈ alkenyl, (C₁-C₁₈ alkyl)OH, (C₁-C₁₈ alkyl)SH,(C₂-C₃ alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄ alkyl)COOH, (C₁-C₄alkyl)NH₂, (C₁-C₄ alkyl)NHC(NH₂ ⁺)NH₂, (C₀-C₄ alkyl)(C₃-C₆ cycloalkyl),(C₀-C₄ alkyl)(C₂-C₅ heterocyclic), (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, (C₁-C₄alkyl)(C₃-C₉ heteroaryl), and C₁-C₁₂ alkyl(W₁)C₁-C₁₂ alkyl, wherein W₁is a heteroatom selected from the group consisting of N, S and O, or R₁and R₂ together with the atoms to which they are attached form a C₃-C₁₂cycloalkyl; or R₄ and R₈ together with the atoms to which they areattached form a C₃-C₆ cycloalkyl;

R₃ is selected from the group consisting of C₁-C₁₈ alkyl, (C₁-C₁₈alkyl)OH, (C₁-C₁₈ alkyl)NH₂, (C₁-C₁₈ alkyl)SH, (C₀-C₄alkyl)(C₃-C₆)cycloalkyl, (C₀-C₄ alkyl)(C₂-C₅ heterocyclic), (C₀-C₄alkyl)(C₆-C₁₀ aryl)R₇, and (C₁-C₄ alkyl)(C₃-C₉ heteroaryl) or R₄ and R₃together with the atoms to which they are attached form a 4, 5 or 6member heterocyclic ring;

R₅ is NHR₆ or OH;

R₆ is H, C₁-C₈ alkyl or R₆ and R₁ together with the atoms to which theyare attached form a 4, 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of H, OH, C₁-C₁₈ alkyl, C₂-C₁₈alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂, (C₀-C₄alkyl)OH, and halo.

In another embodiment the dipeptide prodrug element comprises thegeneral structure:

wherein

R₁ and R₈ are independently H or C₁-C₈ alkyl;

R₂ and R₄ are independently selected from the group consisting of H,C₁-C₈ alkyl, C₂-C₈ alkenyl, (C₁-C₄ alkyl)OH, (C₁-C₄ alkyl)SH, (C₂-C₃alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄ alkyl)COOH, (C₁-C₄ alkyl)NH₂,(C₁-C₄ alkyl)NHC(NH₂+) NH₂, (C₀-C₄ alkyl)(C₃-C₆ cycloalkyl), (C₀-C₄alkyl)(C₂-C₅ heterocyclic), (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, and CH₂(C₃-C₉heteroaryl), or R₁ and R₂ together with the atoms to which they areattached form a C₃-C₁₂ cycloalkyl;

R₃ is selected from the group consisting of C₁-C₈ alkyl, (C₁-C₄alkyl)OH, (C₁-C₄ alkyl)NH₂, (C₁-C₄ alkyl)SH, (C₃-C₆)cycloalkyl or R₄ andR₃ together with the atoms to which they are attached form a 5 or 6member heterocyclic ring;

R₅ is NHR₆ or OH;

R₆ is H, C₁-C₈ alkyl, or R₆ and R₂ together with the atoms to which theyare attached form a 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of H, OH, C₁-C₁₈ alkyl, C₂-C₁₈alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂, (C₀-C₄alkyl)OH and halo, provided that when R₄ and R₃ together with the atomsto which they are attached form a 5 or 6 member heterocyclic ring, bothR₁ and R₂ are not H. In one embodiment R₇ is H or OH. In one embodimenteither the first amino acid and/or the second amino acid of thedipeptide prodrug element is an amino acid in the D stereoisomerconfiguration.

In a further embodiment the prodrug element of Formula X is providedwherein

-   -   R₁ is selected from the group consisting of H and C₁-C₈ alkyl;        and    -   R₂ and R₄ are independently selected from the group consisting        of H, C₁-C₈ alkyl, C₂-C₈ alkenyl, (C₁-C₄ alkyl)OH, (C₁-C₄        alkyl)SH, (C₂-C₃ alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄        alkyl)COOH, (C₁-C₄ alkyl)NH₂, (C₁-C₄ alkyl)NHC(NH₂ ⁺) NH₂,        (C₀-C₄ alkyl)(C₃-C₆ cycloalkyl), (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇,        and CH₂(C₅-C₉ heteroaryl), or R₁ and R₂ together with the atoms        to which they are attached form a C₃-C₈ cycloalkyl ring;    -   R₃ is selected from the group consisting of C₁-C₈ alkyl, (C₁-C₄        alkyl)OH, (C₁-C₄ alkyl)SH, (C₁-C₄ alkyl)NH₂, (C₃-C₆)cycloalkyl        or R₄ and R₃ together with the atoms to which they are attached        form a 5 or 6 member heterocyclic ring;    -   R₅ is NHR₆ or OH;    -   R₆ is H, or R₆ and R₂ together with the atoms to which they are        attached form a 5 or 6 member heterocyclic ring;    -   R₇ is selected from the group consisting of hydrogen, C₁-C₁₈        alkyl, C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH,        (C₀-C₄ alkyl)NH₂, (C₀-C₄ alkyl)OH and halo, and R₈ is H,        provided that when the dipeptide element is linked to an        N-terminal amine and R₄ and R₃ together with the atoms to which        they are attached form a 5 or 6 member heterocyclic ring, both        R₁ and R₂ are not H. In one embodiment either the first amino        acid and/or the second amino acid of the dipeptide prodrug        element is an amino acid in the D stereoisomer configuration.

In other embodiments the dipeptide prodrug element has the structure ofFormula X, wherein

R₁ and R₈ are independently H or C₁-C₈ alkyl;

R₂ and R₄ are independently selected from the group consisting of H,C₁-C₈ alkyl, C₂-C₈ alkenyl, (C₁-C₄ alkyl)OH, (C₁-C₄ alkyl)SH, (C₂-C₃alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄ alkyl)COOH, (C₁-C₄ alkyl)NH₂,(C₁-C₄ alkyl)NHC(NH₂+) NH₂, (C₀-C₄ alkyl)(C₃-C₆ cycloalkyl), (C₀-C₄alkyl)(C₂-C₅ heterocyclic), (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, and CH₂(C₃-C₉heteroaryl), or R₁ and R₂ together with the atoms to which they areattached form a C₃-C₁₂ cycloalkyl;

R₃ is C₁-C₁₈ alkyl;

R₅ is NHR₆;

R₆ is H or C₁-C₈ alkyl; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo.

In a further embodiment the dipeptide prodrug element has the structureof Formula X, wherein

R₁ and R₂ are independently C₁-C₁₈ alkyl or (C₀-C₄ alkyl)(C₆-C₁₀aryl)R₇; or R₁ and R₂ are linked through —(CH₂)_(p), wherein p is 2-9;

R₃ is C₁-C₁₈ alkyl;

R₄ and R₈ are each hydrogen;

R₅ is NH₂; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo.

In a further embodiment the dipeptide prodrug element has the structureof Formula X, wherein

R₁ and R₂ are independently selected from the group consisting ofhydrogen, C₁-C₁₈ alkyl, (C₁-C₁₈ alkyl)OH, (C₁-C₄ alkyl)NH₂, and (C₀-C₄alkyl)(C₆-C₁₀ aryl)R₇, or R₁ and R₂ are linked through (CH₂)_(p),wherein p is 2-9;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-12 heterocyclic ring;

R₄ and R₈ are independently selected from the group consisting ofhydrogen, C₁-C₈ alkyl and (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₅ is NH₂; and

R₇ is selected from the group consisting of H, C₁-C₁₈ alkyl, C₂-C₁₈alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂, (C₀-C₄alkyl)OH, and halo, with the proviso that both R₁ and R₂ are nothydrogen and provided that at least one of R₄ or R₈ is hydrogen.

In another embodiment the dipeptide prodrug element has the structure ofFormula X, wherein

R₁ and R₂ are independently selected from the group consisting ofhydrogen, C₁-C₈ alkyl and (C₁-C₄ alkyl)NH₂, or R₁ and R₂ are linkedthrough (CH₂)_(p), wherein p is 2-9;

R₃ is C₁-C₈ alkyl or R₃ and R₄ together with the atoms to which they areattached form a 4-6 heterocyclic ring;

R₄ is selected from the group consisting of hydrogen and C₁-C₈ alkyl; R₈is hydrogen; and

R₅ is NH₂, with the proviso that both R₁ and R₂ are not hydrogen.

In a further embodiment the dipeptide prodrug element has the structureof Formula X, wherein

R₁ and R₂ are independently selected from the group consisting ofhydrogen, C₁-C₈ alkyl and (C₁-C₄ alkyl)NH₂;

R₃ is C₁-C₆ alkyl;

R₄ and R₈ are each hydrogen; and

R₅ is NH₂, with the proviso that both R₁ and R₂ are not hydrogen.

In another embodiment the dipeptide prodrug element has the structure ofFormula X, wherein

R₁ and R₂ are independently selected from the group consisting ofhydrogen and C₁-C₈ alkyl, (C₁-C₄ alkyl)NH₂, or R₁ and R₂ are linkedthrough (CH₂)_(p), wherein p is 2-9;

R₃ is C₁-C₈ alkyl;

R₄ is (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₅ is NH₂;

R₇ is selected from the group consisting of hydrogen, C₁-C₈ alkyl and(C₀-C₄ alkyl)OH; and

R₈ is hydrogen, with the proviso that both R₁ and R₂ are not hydrogen.

In another embodiment the dipeptide prodrug element has the structure ofFormula X, wherein

R₁ is selected from the group consisting of hydrogen, C₁-C₈ alkyl and(C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₂ is hydrogen;

R₃ is C₁-C₁₈ alkyl;

R₄ and R₈ are each hydrogen;

R₅ is NHR₆ or OH;

R₆ is H, C₁-C₈ alkyl, or R₆ and R₁ together with the atoms to which theyare attached form a 4, 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo, with the proviso that, if R₁ is alkyl or(C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, then R₁ and R₆ together with the atoms towhich they are attached form a 4-11 heterocyclic ring.

In one embodiment a single chain insulin analog is provided comprisingan A chain and a B chain wherein the carboxy terminus of the B chain islinked to the amino terminus of said A chain via a linking moiety. Inone embodiment a single chain insulin analog is provided that comprisesthe structure: IB-LM-IA, wherein IB comprises the sequenceR₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 21), LM is alinking moiety as disclosed herein that covalently links IB to IA, andIA comprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 22), wherein the amino acid at the designation X₄₅ isphenylalanine or tyrosine that is directly bound to the linking moiety,LM. In one embodiment a single chain insulin analog is provided thatcomprises the structure: IB-LM-IA, wherein IB comprises the sequenceJ-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 20) or asequence that differs from SEQ ID NO: 20 by 1 to 3 amino acidmodifications selected from positions 5, 6, 9, 10, 16, 18, 19 and 21 ofSEQ ID NO: 20, LM is a linking moiety as disclosed herein and IAcomprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 22) or a sequence that differs from SEQ ID NO: 19 by 1 to 3amino acid modifications selected from positions 5, 8, 9, 10, 14, 15,17, 18 and 21 of SEQ ID NO: 20, wherein

J is H or a dipeptide element comprising the general structure of U-B,wherein U is an amino acid or a hydroxyl acid and B is an N-alkylatedamino acid linked through an amide bond;

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, ornithine, lysine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, ornithine, lysine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, ornithine, lysine orleucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure U-B, wherein U is an amino acid or a hydroxyl acid and        B is an N-alkylated amino acid;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₃₆ is an amino acid of the general structure

-   -   wherein X₁₂ is selected from the group consisting of OH and        NHR₁₁, wherein R₁₁ is a dipeptide element comprising the general        structure U-B;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, ornithine, lysineand arginine;

X₄₅ is an amino acid of the general structure

-   -   wherein X₁₃ is selected from the group consisting of H, OH and        NHR₁₂, wherein R₁₂ is H or dipeptide element comprising the        general structure U-B;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and an N-terminal amine;

R₂₃ is a bond or an amino sequence comprising 1 to 6 charged aminoacids; and

R₁₃ is COOH or CONH₂. In one embodiment, only one of J, R₁₀, R₁₁ and R₁₂is U-B. In one embodiment X₁₂ is OH and X₁₃ is H or OH and J and/or X isU-B. In one embodiment X₁₂ is OH and X₁₃ is OH, R₂₃ is a bond, J is Hand X is U-B. In a further embodiment X₈, X₂₅ and X₃₀ are eachhistidine. In another embodiment the single chain insulin analog peptidecomprises an A chain peptide sequence of SEQ ID NO: 19 and a B chainpeptide sequence of SEQ ID NO: 20. In one embodiment R₂₃ is a bond orcomprises an amino sequence X₆₀(X₆₁X₆₂)_(d)X₆₃K (SEQ ID NO: 192) whereinX₆₀ is selected from the group consisting of glycine, glutamic acid andaspartic acid, X₆₁ and X₆₂ are independently selected from the groupconsisting of glutamic acid and aspartic acid, X₆₃ is selected from thegroup consisting of arginine aspartic acid and glutamic acid and d is aninteger ranging from 1-3.

In one embodiment a single chain insulin analog is provided wherein IBcomprises the sequence J-R₂₃R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅(SEQ ID NO: 20) or J-R₂₃R₂₂—X₂₅LCGX₂₉X₃₀LVEALYLVCG ERGFF (SEQ ID NO: 53)wherein the carboxy terminus of the B25 amino acid of IB is directlylinked to a first end of the linking moiety (LM) and a second end of thelinking moiety is directly linked to the amino terminus of the A1 aminoacid of the IA chain, further wherein, in one embodiment, the linkingmoiety does not comprise the sequence YTPKT (SEQ ID NO: 16) or FNKPT(SEQ ID NO: 76). In one embodiment a single chain insulin analog isprovided wherein IB comprises the sequenceJ-R₂₃R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 20),J-R₂₃R₂₂—X₂₅LCGX₂₉X₃₀LVEALYLVCG ERGFF (SEQ ID NO: 53),R₂₃R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 20) orR₂₃R₂₂—X₂₅LCGX₂₉X₃₀LVEALYLVCG ERGFF (SEQ ID NO: 53) and the linkingmoiety is a peptide linker of the general structure(Y₁)_(k)—X_(m)AX₅₃X₅₄X₅₅X₅₆X₅₇RR(Y₂)_(n) (SEQ ID NO: 28),(Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆RR(Y₂)_(n) (SEQ ID NO: 23),(Y₁)_(k)-GYGSSSX₅₇X₅₈(Y₂)_(n) (SEQ ID NO: 85),(Y₁)_(k)-GAGSSSX₅₇X₅₈—(Y₂)_(n) (SEQ ID NO: 163), (Y₁)_(k)-GYGSSSX₅₇R(SEQ ID NO: 51) or (Y₁)_(k)—X₅₁X₅₂GSSSX₅₇X₅₈—(Y₂)_(n) (SEQ ID NO: 29)wherein Y₁ is selected from the group X₄₆, and X₄₆X₄₇, with the provisothat when k is 0, then, the linking peptide does not comprise thesequence YTPKT (SEQ ID NO: 16) or FNKPT (SEQ ID NO: 76).

In one embodiment a single chain insulin analog is provided thatcomprises the structure: IB-LM-IA, wherein IB comprises the sequenceJ-R₂₃R₂₂—X₂₅LCGX₂₉X₃₀LVEALYLVCG ERGFF (SEQ ID NO: 53), LM is a linkingmoiety as disclosed herein and IA comprises the sequenceGIVEQCCX₈SICSLYQLX₁₇NX₁₉CX₂₃ (SEQ ID NO: 52) wherein

X₈ is selected from the group consisting of threonine and histidine;

X₁₇ is glutamic acid or glutamine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure U-B, wherein U is an amino acid or a hydroxyl acid and        B is an N-alkylated amino acid;

X₂₃ is asparagine or glycine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

R₂₂ is selected from the group consisting of FVNQ (SEQ ID NO: 12), atripeptide valine-asparagine-glutamine, a dipeptideasparagine-glutamine, glutamine, glutamic acid and an N-terminal amine;and

R₂₃ is a bond or an amino sequence comprising 1 to 6 charged aminoacids.

In a further embodiment the B chain comprises the sequenceX₂₂VNQX₂₅LCGX₂₉X₃₀LVEALYLVCGERGFFYT-Z₁-B₁ (SEQ ID NO: 54) wherein

X₂₂ is selected from the group consisting of phenylalanine anddesamino-phenylalanine;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

Z₁ is a dipeptide selected from the group consisting ofaspartate-lysine, lysine-proline, and proline-lysine; and

B₁ is selected from the group consisting of threonine, alanine or athreonine-arginine-arginine tripeptide.

In accordance with one embodiment a single chain insulin analog isprovided that comprises the structure: IB-LM-IA, wherein IB comprisesthe sequence X₂₅LCGX₂₉X₃₀LVEALYLVCG ERGFF (SEQ ID NO: 53), LM is alinking moiety as disclosed herein that covalently links IB to IA, andIA comprises the sequence GIVEQCCX₈SICSLYQLENX₁₉CX₂₁ (SEQ ID NO: 55),wherein the C-terminal phenylalanine residue of SEQ ID NO: 53 isdirectly covalently bound to the linking moiety, LM, in the absence ofany intervening amino acids.

In one embodiment a single chain insulin analog is provided thatcomprises the structure: IB-LM-IA, wherein IB comprises the sequenceJ-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGDX₄₂GFX₄₅ (SEQ ID NO: 58), LM is alinking moiety as disclosed herein and IA comprises the sequenceGIVX₄ECCX₈X₉SCDLX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 19) wherein

J is H or a dipeptide element comprising the general structure of U-B,wherein U is an amino acid or a hydroxyl acid and B is an N-alkylatedamino acid linked through an amide bond;

X₄ is aspartic acid or glutamic acid;

X₈ is histidine or phenylalanine;

X₉ and X₁₄ are independently selected from arginine, ornithine, lysineor alanine;

X₁₅ is arginine, lysine, ornithine or leucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine or threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure U-B, wherein U is an amino acid or a hydroxyl acid and        B is an N-alkylated amino acid;

X₂₁ is alanine, glycine or asparagine;

R₂₂ is selected from the group consisting of a covalent bond, AYRPSE(SEQ ID NO: 14), a glycine-proline-glutamic acid tripeptide, aproline-glutamic acid dipeptide and glutamic acid;

R₂₃ is a bond or an amino sequence comprising 1 to 6 charged aminoacids;

X₂₅ is selected from the group consisting of histidine and threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid and glutamicacid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₂ is selected from the group consisting of alanine, lysine ornithineand arginine;

X₄₅ is an amino acid of the general structure

-   -   wherein X₁₃ is selected from the group consisting of H, OH and        NHR₁₂, wherein R₁₂ is a dipeptide element comprising the general        structure U-B;

and

R₁₃ is COOH or CONH₂, with the proviso that one and only one of J, X,and X₁₃, comprises U-B. In one embodiment J is H, and X₁₃ is OH, and Xis NH-U-B.

In one embodiment U and B of the dipeptide prodrug element U-B areselected to inhibit enzymatic cleavage of the U-B dipeptide from aninsulin peptide by enzymes found in mammalian serum. In one embodiment Uand/or B are selected such that the cleavage half-life of U-B from theinsulin peptide, in PBS under physiological conditions, is not more thantwo fold the cleavage half-life of U-B from the insulin peptide in asolution comprising a DPP-IV protease (i.e., cleavage of U-B from theinsulin prodrug does not occur at a rate more than 2× faster in thepresence of DPP-IV protease and physiological conditions relative toidentical conditions in the absence of the enzyme). In one embodiment U,B, or the amino acid of the insulin peptide to which U-B is linked is anon-coded amino acid. In one embodiment U and/or B is an amino acid inthe D stereoisomer configuration. In some exemplary embodiments, U is anamino acid in the D stereoisomer configuration and B is an amino acid inthe L stereoisomer configuration. In some exemplary embodiments, U is anamino acid in the L stereoisomer configuration and B is an amino acid inthe D stereoisomer configuration. In some exemplary embodiments, U is anamino acid in the D stereoisomer configuration and B is an amino acid inthe D stereoisomer configuration. In one embodiment U-B is a dipeptidecomprising the structure of Formula X as defined herein. In oneembodiment B is an N-alkylated amino acid but is not proline.

In accordance with one embodiment a single chain insulin agonistpolypeptide comprising a B chain and A chain of human insulin, oranalogs or derivative thereof is provided, wherein the carboxy terminusof the B chain is linked to the amino terminus of the A chain via alinking moiety. In one embodiment the linking moiety comprises apolyethylene glycol of 6-16 monomer units, an amino acid sequence of atleast 8 amino acids and no more than 12 amino acid in length (or apeptidomimetic thereof), or a combination of said polyethylene glycoland amino acid sequence. More particularly, in one embodiment thelinking moiety comprises an amino acid sequence, X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆RR(SEQ ID NO: 10), wherein X₅₁ is selected from the group consisting ofglycine, alanine, valine, leucine, isoleucine, proline, phenylalanineand methionine; X₅₂ is any amino acid other than tyrosine, including forexample any non-aromatic amino acid; and X₃ through X₆ are eachindependently any amino acid.

In one embodiment the single chain insulin analog comprises thestructure IB-LM-IA, wherein IB comprises the sequence

J-R₂₃R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ SEQ ID NO: 20);

LM is a linking moiety is selected from the group consisting of(Y₁)_(k)—X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈(Y₂)_(n) (SEQ ID NO: 9),(Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆RR(Y₂)_(n) (SEQ ID NO: 23),(Y₁)_(k)-GYGSSSX₅₇X₅₈(Y₂)_(n) (SEQ ID NO: 85) and

and

IA comprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 22) wherein

J is H or a dipeptide comprising the general structure of U-B, wherein Uis an amino acid or a hydroxyl acid and B is an N-alkylated amino acidlinked through an amide bond;

Y₁ is selected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13); and

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15) wherein

n is 0 or 1;

k is 0 or 1;

m is an integer ranging from 5 to 15;

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH, OCH₃ or        NHR₁₀, wherein R₁₀ is H or a dipeptide element comprising the        general structure U-B, wherein U is an amino acid or a hydroxyl        acid and B is an N-alkylated amino acid;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₃₆ is an amino acid of the general structure

-   -   wherein X₁₂ is selected from the group consisting of OH and        NHR₁₁, wherein R₁₁ is a dipeptide element comprising the general        structure U-B;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is an amino acid of the general structure

-   -   wherein X₁₃ is selected from the group consisting of H, OH and        NHR₁₂, wherein R₁₂ is H or dipeptide element comprising the        general structure U-B;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₄₆ through X₅₆ and X₇₀ through X₇₃ are each independently any aminoacid or amino acid analog or derivative thereof;

X₅₇ and X₅₈ are independently selected from the group arginine,ornithine and lysine.

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and a bond;

R₂₃ is a bond or an amino sequence comprising 1 to 6 charged aminoacids; and

R₁₃ is COOH or CONH₂, with the proviso that U, B, or the amino acid ofthe single chain insulin agonist to which U-B is linked is a non-codedamino acid. In one embodiment at least one of n or k is 1.

In one embodiment a single chain insulin analog is provided comprisingthe structure IB-LM-IA, wherein IB comprises the sequence

R₂₃R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 21);

LM is a linking moiety is selected from the group consisting of(Y₁)_(k)—X₅₁X₅₂X₅₃X₅₄X₅₅X₅₆X₅₇X₅₈(Y₂)_(n) (SEQ ID NO: 9),(Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆RR(Y₂)_(n) (SEQ ID NO: 23),(Y₁)_(k)-GYGSSSX₅₇X₅₈(Y₂)_(n) (SEQ ID NO: 85) and

and

IA comprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 22) wherein

Y₁ is selected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13); and

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15) wherein

n is 0 or 1;

k is 0 or 1;

m is an integer ranging from 5 to 15;

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure U-B, wherein U is an amino acid or a hydroxyl acid and        B is an N-alkylated amino acid;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₅ is tyrosine or phenylalanine;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₄₆ through X₅₆ and X₇₀ through X₇₃ are each independently any aminoacid or amino acid analog or derivative thereof;

X₅₇ and X₅₈ are independently selected from the group arginine,ornithine and lysine;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and a bond;

R₂₃ is a H or an amino sequence comprising 1 to 6 charged amino acids;and

R₁₃ is COOH or CONH₂. In one embodiment at least one of n or k is 1. Inone embodiment both n and k are 1. In a further embodiment Y₂ isselected from the group consisting of A, AP, APQ and APQT and Y₁ isselected from the group consisting of F, Y, FN, YT, FNK, YTP, FNPK (SEQID NO: 79), FNKP (SEQ ID NO: 77), YTPK (SEQ ID NO: 78), YTPKT (SEQ IDNO: 16), YTKPT (SEQ ID NO: 80), FNKPT (SEQ ID NO: 76) and FNPKT (SEQ IDNO: 81). In one embodiment R₂₃ is a bond and R₂₂ is selected from thegroup consisting of AYRPSE (SEQ ID NO: 14), PGPE (SEQ ID NO: 11), atripeptide glycine-proline-glutamic acid, a dipeptide proline-glutamicacid, glutamine, glutamic acid and H.

In one embodiment R₂₃ is H or an amino sequence of 4 to 7 amino acidswherein the N-terminal amino acid is selected from the group consistingof glycine, glutamic acid and aspartic acid, the C-terminal amino acidis a lysine and the other amino acids of the sequence are independentlyselected from the group consisting of glutamic acid and aspartic acidand R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, adipeptide proline-glutamic acid, glutamine, glutamic acid and H.

In one embodiment U-B comprises the structure of Formula X:

wherein

-   -   R₁, R₂, R₄ and R₈ are independently selected from the group        consisting of H, C₁-C₁₈ alkyl, C₂-C₁₈ alkenyl, (C₁-C₁₈ alkyl)OH,        (C₁-C₁₈ alkyl)SH, (C₂-C₃ alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄        alkyl)COOH, (C₁-C₄ alkyl)NH₂, (C₁-C₄ alkyl)NHC(NH₂ ⁺)NH₂, (C₀-C₄        alkyl)(C₃-C₆ cycloalkyl), (C₀-C₄ alkyl)(C₂-C₅ heterocyclic),        (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, (C₁-C₄ alkyl)(C₃-C₉ heteroaryl),        and C₁-C₁₂ alkyl(W₁)C₁-C₁₂ alkyl, wherein W₁ is a heteroatom        selected from the group consisting of N, S and O, or    -   R₁ and R₂ together with the atoms to which they are attached        form a C₃-C₁₂ cycloalkyl or aryl; or    -   R₄ and R₈ together with the atoms to which they are attached        form a C₃-C₆ cycloalkyl;    -   R₃ is selected from the group consisting of C₁-C₁₈ alkyl,        (C₁-C₁₈ alkyl)OH, (C₁-C₁₈ alkyl)NH₂, (C₁-C₁₈ alkyl)SH, (C₀-C₄        alkyl)(C₃-C₆)cycloalkyl, (C₀-C₄ alkyl)(C₂-C₅ heterocyclic),        (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, and (C₁-C₄ alkyl)(C₃-C₉        heteroaryl) or R₄ and R₃ together with the atoms to which they        are attached form a 4, 5 or 6 member heterocyclic ring;    -   R₅ is NHR₆ or OH;    -   R₆ is H, C₁-C₈ alkyl or R₆ and R₁ together with the atoms to        which they are attached form a 4, 5 or 6 member heterocyclic        ring; and    -   R₇ is selected from the group consisting of H, OH, C₁-C₁₈ alkyl,        C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄        alkyl)NH₂, (C₀-C₄ alkyl)OH, and halo.

In another embodiment a single chain insulin analog is providedcomprising the structure IB-LM-IA, wherein IB comprises the sequence

J-R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 21);

LM comprises the sequence (Y₁)_(k)-GYGSSSGX₅₇R(Y₂)_(n) (SEQ ID NO: 91)or (Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆X₅₇R(Y₂)_(n) (SEQ ID NO: 28); and

IA comprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 22) wherein

J is H or a dipeptide comprising the general structure of U-B, wherein Uis an amino acid or a hydroxyl acid and B is an N-alkylated amino acidlinked through an amide bond;

Y₁ is selected from the group X₄₆, X₄₆X₄₇, X₄₆X₄₇X₄₈, X₄₆X₄₇X₄₈X₄₉ (SEQID NO: 24) and X₄₆X₄₇X₄₈X₄₉X₅₀ (SEQ ID NO: 13); and

Y₂ is selected from the group X₇₀, X₇₀X₇₁, X₇₀X₇₁X₇₂ and X₇₀X₇₁X₇₂X₇₃(SEQ ID NO: 15) wherein

n is 0 or 1;

k is 0 or 1;

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure U-B, wherein U is an amino acid or a hydroxyl acid and        B is an N-alkylated amino acid;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is an amino acid of the general structure

-   -   wherein X₁₃ is selected from the group consisting of H, OH and        NHR₁₂, wherein R₁₂ is H or dipeptide element comprising the        general structure U-B;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₄₆ through X₅₆ and X₇₀ through X₇₃ are each independently any aminoacid or amino acid analog or derivative thereof; and

X₅₇ is arginine, lysine, ornithine or lysine;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and a bond;

R₂₃ is a bond or an amino sequence comprising 1 to 6 charged aminoacids; and

R₁₃ is COOH or CONH₂. In one embodiment k is 0, in another embodiment nis 0 and in one further embodiment both k and n are 0. In one embodimentthe linking moiety is GYGSSSRR (SEQ ID NO: 18) or GX₅₂GSSSRRAPQT (SEQ IDNO: 83), wherein X₅₂ is a non-aromatic amino acid.

In another embodiment the single chain insulin analogs comprise thestructure IB-LM-IA, wherein IB comprises sequence

J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅(SEQ ID NO: 21);

LM comprises the structure:

andIA comprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 22) wherein

J is H or a dipeptide element comprising the general structure of U-B,wherein U is an amino acid or a hydroxyl acid and B is an N-alkylatedamino acid linked through an amide bond;

m is an integer ranging from 5 to 15;

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure U-B, wherein U is an amino acid or a hydroxyl acid and        B is an N-alkylated amino acid;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is an amino acid of the general structure

-   -   wherein X₁₃ is selected from the group consisting of H, OH and        NHR₁₂, wherein R₁₂ is H or dipeptide element comprising the        general structure U-B;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₄₆ through X₅₆ and X₇₀ through X₇₃ are each independently any aminoacid or amino acid analog or derivative thereof;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and an N-terminal amine;

R₂₃ is a bond or X₆₀(X₆₁X₆₂)_(d)X₆₃K (SEQ ID NO: 192)

-   -   wherein    -   X₆₀ is selected from the group consisting of glycine, glutamic        acid and aspartic acid;    -   X₆₁ and X₆₂ are independently selected from the group consisting        of glutamic acid and aspartic acid;    -   X₆₃ is selected from the group consisting of arginine aspartic        acid and glutamic acid;    -   d is an integer ranging from 1-3; and

R₁₃ is COOH or CONH₂.

In one embodiment the single chain insulin analog comprises thestructure IB-LM-IA, wherein IB comprises sequence

J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ SEQ ID NO: 20);

LM comprises the sequence (Y₁)_(k)—X₅₁AX₅₃X₅₄X₅₅X₅₆RR(Y₂)_(n) (SEQ IDNO: 23); and

IA comprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂T-X₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 22), wherein n is 1 and at least one of X₄₆, X₄₇, X₄₈, X₄₉,and/or X₅₀ is an amino acid selected from the group of formula I, II orIII. In one embodiment Y₁ is selected from the group consisting of F, Y,FN, YT, FNK, YTP, FNPK (SEQ ID NO: 79), FNKP (SEQ ID NO: 77), YTPK (SEQID NO: 78), YTPKT (SEQ ID NO: 16), YTKPT (SEQ ID NO: 80), FNKPT (SEQ IDNO: 76) and FNPKT (SEQ ID NO: 81). In another embodiment Y₁ is selectedfrom the group consisting of F, FN, FNK, FNPK (SEQ ID NO: 79), FNKPT(SEQ ID NO: 76) and FNPKT (SEQ ID NO: 81). In one embodiment Y₂ isselected from the group consisting of A, AP, APQ and APQT (SEQ ID NO:82). In a further embodiment n is 0. In one embodiment X₅₃, X₅₄, X₅₅ andX₅₆ are each independently selected from the group consisting ofglycine, alanine, serine, threonine and proline. In one embodimentwherein IB comprises sequenceJ-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 20), LMcomprises a sequence selected from the group consisting ofFX₅₁AX₅₃X₅₄X₅₅X₅₆RR (SEQ ID NO: 94), FX₅₁AX₅₃X₅₄X₅₅X₅₆RRA (SEQ ID NO:95), FX₅₁AX₅₃X₅₄X₅₅X₅₆RRAP (SEQ ID NO: 96), FX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQ (SEQID NO: 97), FNX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQT (SEQ ID NO: 98),FNX₅₁AX₅₃X₅₄X₅₅X₅₆RR (SEQ ID NO: 99), FNX₅₁AX₅₃X₅₄X₅₅X₅₆RRA (SEQ ID NO:100), FNX₅₁AX₅₃X₅₄X₅₅X₅₆RRAP (SEQ ID NO: 101), FNX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQ(SEQ ID NO: 102), FNX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQT (SEQ ID NO: 103),FNKX₅₁AX₅₃X₅₄X₅₅X₅₆RR (SEQ ID NO: 104), FNKX₅₁AX₅₃X₅₄X₅₅X₅₆RRA (SEQ IDNO: 105), FNKX₅₁AX₅₃X₅₄X₅₅X₅₆RRAP (SEQ ID NO: 106),FNKX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQ (SEQ ID NO: 107), FNKX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQT(SEQ ID NO: 108), FNKPX₅₁AX₅₃X₅₄X₅₅X₅₆RR (SEQ ID NO: 109),FNKPX₅₁AX₅₃X₅₄X₅₅X₅₆RRA (SEQ ID NO: 110), FNKPX₅₁AX₅₃X₅₄X₅₅X₅₆RRAP (SEQID NO: 111), FNKPX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQ (SEQ ID NO: 112),FNKPX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQT (SEQ ID NO: 113), FNKPTX₅₁AX₅₃X₅₄X₅₅X₅₆RR(SEQ ID NO: 114), FNKPTX₅₁AX₅₃X₅₄X₅₅X₅₆RRA (SEQ ID NO: 115),FNKPTX₅₁AX₅₃X₅₄X₅₅X₅₆RRAP (SEQ ID NO: 116), FNKPTX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQ(SEQ ID NO: 117), FNKPTX₅₁AX₅₃X₅₄X₅₅X₅₆RRAPQT (SEQ ID NO: 118),FX₅₁X₅₂GSSSRR (SEQ ID NO: 119), FX₅₁X₅₂GSSSRRA (SEQ ID NO: 120),FX₅₁X₅₂GSSSRRAP (SEQ ID NO: 121), FX₅₁X₅₂GSSSRRAPQ (SEQ ID NO: 122),FNX₅₁X₅₂GSSSRRAPQT (SEQ ID NO: 123), FNX₅₁X₅₂GSSSRR (SEQ ID NO: 124),FNX₅₁X₅₂GSSSRRA (SEQ ID NO: 125), FNX₅₁X₅₂GSSSRRAP (SEQ ID NO: 126),FNX₅₁X₅₂GSSSRRAPQ (SEQ ID NO: 127), FNX₅₁X₅₂GSSSRRAPQT (SEQ ID NO: 128),FNKX₅₁X₅₂GSSSRR (SEQ ID NO: 129), FNKX₅₁X₅₂GSSSRRA (SEQ ID NO: 130),FNKX₅₁X₅₂GSSSRRAP (SEQ ID NO: 131), FNKX₅₁X₅₂GSSSRRAPQ (SEQ ID NO: 132),FNKX₅₁X₅₂GSSSRRAPQT (SEQ ID NO: 133), FNKPX₅₁X₅₂GSSSRR (SEQ ID NO: 134),FNKPX₅₁X₅₂GSSSRRA (SEQ ID NO: 135), FNKPX₅₁X₅₂GSSSRRAP (SEQ ID NO: 136),FNKPX₅₁X₅₂GSSSRRAPQ (SEQ ID NO: 137), FNKPX₅₁X₅₂GSSSRRAPQT (SEQ ID NO:138), FNKPTX₅₁X₅₂GSSSRR (SEQ ID NO: 139), FNKPTX₅₁X₅₂GSSSRRA (SEQ ID NO:140), FNKPTX₅₁X₅₂GSSSRRAP (SEQ ID NO: 141), FNKPTX₅₁X₅₂GSSSRRAPQ (SEQ IDNO: 142), and FNKPTX₅₁X₅₂GSSSRRAPQT (SEQ ID NO: 143), wherein X₅₁ isselected from the group consisting of glycine, alanine, valine, leucine,isoleucine, proline, phenylalanine and methionine, X₅₂ is any amino acidother than tyrosine and X₄₆ through X₅₆ are each independently any aminoacid or amino acid analog or derivative thereof. In one embodimentlinking moiety of SEQ ID NO: 94-143 is pegylated and in a furtherembodiment an arginine residue of the linking moiety of SEQ ID NO:94-143 is substituted with a pegylated lysine. In one embodiment thepegylated lysine is located at position 8 relative to the native IGF 1 Cpeptide (SEQ ID NO: 17). In one embodiment X₅₂ is any non-aromatic aminoacid. In one embodiment wherein IB comprises sequenceJ-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 20), LMconsists of the sequence GAGSSSX₅₇X₅₈ (SEQ ID NO: 163), GAGSSSRX₅₈APQ(SEQ ID NO: 167), TGYGSSSRR (SEQ ID NO: 18), TGYGSSSRR (SEQ ID NO: 144),KTGYGSSSRR (SEQ ID NO: 145), PKTGYGSSSRR (SEQ ID NO: 146), TPKTGYGSSSRR(SEQ ID NO: 147), TKPTGYGSSSRR (SEQ ID NO: 148) or SRPAGYGSSSRR (SEQ IDNO: 149), wherein X₅₇ and X₅₈ are independently selected from arginine,ornithine and lysine.

In one embodiment the single chain insulin analog comprises a sequenceselected from the group consisting of

a) J-R₂₃—R₂₂—X₂₅ LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-X₅₁X₅₂GSSSRR (SEQ ID NO:27)-GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22);

b) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-X₄₆X₄₇X₄₈X₄₉X₅₀X₅₁X₅₂GSSSRR (SEQ ID NO:86)-GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22);

c) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-X₅₁X₅₂GSSSRR (SEQ ID NO: 27)-APQT (SEQ ID NO:82)-GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22);

d) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-X₄₆X₄₇X₄₈X₄₉X₅₀X₅₁X₅₂GSSSRX₅₈APQT (SEQ ID NO:87)-GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22);

e) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVEALYLVCGERGFF (SEQ ID NO:53)-X₅₁X₅₂GSSSX₅₇X₅₈APQT (SEQ ID NO: 46)-GIVEQCCX₈SICSLYQLENX₁₉CX₂₁—R₁₃(SEQ ID NO: 55);

f) J-R₂₃R₂₂-X₂₅LCGX₂₉X₃₀LVEALYLVCGERGFF (SEQ ID NO: 53)-X₅₁X₅₂GSSSRR(SEQ ID NO: 27)-GIVEQCCX₈SICSLYQLENX₁₉CX₂₁—R₁₃ (SEQ ID NO: 55);

g) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVEALYLVCGERGFF (SEQ ID NO:53)-X₄₆X₄₇X₄₈X₄₉X₅₀X₅₁X₅₂GSSSRX₅₈APQT (SEQ ID NO:87)-GIVEQCCX₈SICSLYQLENX₁₉CX₂₁—R₁₃ (SEQ ID NO: 55);

h) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVEALYLVCGERGFF (SEQ ID NO:53)-X₄₆X₄₇X₄₈X₄₉X₅₀X₅₁X₅₂GSSSRR (SEQ ID NO:87)-GIVEQCCX₈SICSLYQLENX₁₉CX₂₁—R₁₃ (SEQ ID NO: 55);

i) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-GYGSSSRR (SEQ ID NO: 18)-GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LEX₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 152);

j) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-GYGSSSRR (SEQ ID NO: 18)-GIVEQCCX₈SICSLYQLENX₁₉CX₂₁—R₁₃ (SEQ ID NO:55); and

wherein

J is H or a dipeptide element comprising the general structure of U-B,wherein U is an amino acid or a hydroxyl acid and B is an N-alkylatedamino acid linked through an amide bond;

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure U-B, wherein U is an amino acid or a hydroxyl acid and        B is an N-alkylated amino acid;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is an amino acid of the general structure

-   -   wherein X₁₃ is selected from the group consisting of H, OH and        NHR₁₂, wherein R₁₂ is H or dipeptide element comprising the        general structure U-B;

X₄₆ through X₅₀ are each independently any amino acid or amino acidanalog or derivative thereof;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine, proline, phenylalanine and methionine;

X₅₂ is any amino acid other than tyrosine;

X₅₇ and X₅₈ are independently arginine or lysine;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and an N-terminal amine;

R₂₃ is a bond or a 1 to 8 amino acid sequence comprising charged aminoacids, including for example negatively charged amino acids; and

R₁₃ is COOH or CONH₂. In one embodiment J is H and X₄₅ is phenylalanineor tyrosine. In a further embodiment R₂₂ is a bond or GX₆₀X₆₁X₆₂X₆₃X₆₄K(SEQ ID NO: 153), wherein X₆₀, X₆₁, X₆₂, X₆₃ and X₆₄ are independentlyglutamic acid or aspartic acid. In a further embodiment the single chaininsulin analog comprises the sequenceR₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-X₅₁X₅₂GSSSRR (SEQ ID NO:27)-GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22) orR₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-GX₅₂GSSSRX₅₈APQT (SEQ ID NO:38)-GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22)

wherein

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure U-B, wherein U is an amino acid or a hydroxyl acid and        B is an N-alkylated amino acid;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is phenylalanine or tyrosine;

X₅₁ is selected from the group consisting of glycine, alanine, valine,leucine, isoleucine and proline;

X₅₂ is alanine;

X₅₈ is arginine or lysine;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and an N-terminal amine;

R₂₃ is a bond or a 1 to 8 amino acid sequence comprising one or morenegatively charged amino acids; and

R₁₃ is COOH or CONH₂. In one embodiment R₂₃ is an N-terminal amine, apeptide comprising X₆₀(X₆₁X₆₂)_(d)X₆₃K (SEQ ID NO: 192), wherein d is aninteger ranging from 1-3, or a peptide selected form the groupconsisting of GEK, GEEK (SEQ ID NO: 179), GEEEK (SEQ ID NO: 178),GEEEEEK (SEQ ID NO: 177), GEEEEEK (SEQ ID NO: 176), GEEEEEEEK (SEQ IDNO: 175), GDK, GDDK (SEQ ID NO: 190), GDDDK (SEQ ID NO: 189), GDDDDK(SEQ ID NO: 188), GDDDDDK (SEQ ID NO: 187), GDDDDDK (SEQ ID NO: 186),GERK (SEQ ID NO: 174), GEERK (SEQ ID NO: 173), GEEERK (SEQ ID NO: 172),GEEEEERK (SEQ ID NO: 171), GEEEEERK (SEQ ID NO: 170), GEEEEEEERK (SEQ IDNO: 169), GDRK (SEQ ID NO: 185, GDDRK (SEQ ID NO: 184), GDDDRK (SEQ IDNO: 183), GDDDDRK (SEQ ID NO: 182), GDDDDDRK (SEQ ID NO: 181), orGDDDDDRK (SEQ ID NO: 180). In accordance with one embodiment thedipeptide element U-B comprises the structure of Formula X:

wherein

(a) R₁, R₂, R₄ and R₈ are independently selected from the groupconsisting of H, C₁-C₁₈ alkyl, C₂-C₁₈ alkenyl, (C₁-C₁₈ alkyl)OH, (C₁-C₁₈alkyl)SH, (C₂-C₃ alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄ alkyl)COOH,(C₁-C₄ alkyl)NH₂, (C₁-C₄ alkyl)NHC(NH₂ ⁺)NH₂, (C₀-C₄ alkyl)(C₃-C₆cycloalkyl), (C₀-C₄ alkyl)(C₂-C₅ heterocyclic), (C₀-C₄ alkyl)(C₆-C₁₀aryl)R₇, (C₁-C₄ alkyl)(C₃-C₉ heteroaryl), and C₁-C₁₂ alkyl(W1)C₁-C₁₂alkyl, wherein W1 is a heteroatom selected from the group consisting ofN, S and O, or

-   -   (ii) R₁ and R₂ together with the atoms to which they are        attached form a C3-C12 cycloalkyl or aryl; or    -   (iii) R₄ and R₈ together with the atoms to which they are        attached form a C₃-C₆ cycloalkyl;

(b) R₃ is selected from the group consisting of C₁-C₁₈ alkyl, (C₁-C₁₈alkyl)OH, (C₁-C₁₈ alkyl)NH₂, (C₁-C₁₈ alkyl)SH, (C₀-C₄alkyl)(C₃-C₆)cycloalkyl, (C₀-C₄ alkyl)(C₂-C₅ heterocyclic), (C₀-C₄alkyl)(C₆-C₁₀ aryl)R₇, and (C₁-C₄ alkyl)(C₃-C₉ heteroaryl) or R₄ and R₃together with the atoms to which they are attached form a 4, 5 or 6member heterocyclic ring;

(c) R₅ is NHR₆ or OH;

(d) R₆ is H, C₁-C₈ alkyl or R₆ and R₂ together with the atoms to whichthey are attached form a 4, 5 or 6 member heterocyclic ring; and

(e) R₇ is selected from the group consisting of H and OH.

In accordance with one embodiment the dipeptide element U-B comprisesthe structure:

wherein

R₁, R₂, R₄ and R₈ are independently selected from the group consistingof H, C₁-C₁₈ alkyl, C₂-C₁₈ alkenyl, (C₁-C₁₈ alkyl)OH, (C₁-C₁₈ alkyl)SH,(C₂-C₃ alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄ alkyl)COOH, (C₁-C₄alkyl)NH₂, (C₁-C₄ alkyl)NHC(NH₂ ⁺)NH₂, (C₀-C₄ alkyl)(C₃-C₆ cycloalkyl),(C₀-C₄ alkyl)(C₂-C₅ heterocyclic), (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, (C₁-C₄alkyl)(C₃-C₉ heteroaryl), and C₁-C₁₂ alkyl(W₁)C₁-C₁₂ alkyl, wherein W₁is a heteroatom selected from the group consisting of N, S and O, or R₁and R₂ together with the atoms to which they are attached form a C₃-C₁₂cycloalkyl; or R₄ and R₈ together with the atoms to which they areattached form a C₃-C₆ cycloalkyl;

R₃ is selected from the group consisting of C₁-C₁₈ alkyl, (C₁-C₁₈alkyl)OH, (C₁-C₁₈ alkyl)NH₂, (C₁-C₁₈ alkyl)SH, (C₀-C₄alkyl)(C₃-C₆)cycloalkyl, (C₀-C₄ alkyl)(C₂-C₅ heterocyclic), (C₀-C₄alkyl)(C₆-C₁₀ aryl)R₇, and (C₁-C₄ alkyl)(C₃-C₉ heteroaryl) or R₄ and R₃together with the atoms to which they are attached form a 4, 5 or 6member heterocyclic ring;

R₅ is NHR₆ or OH;

R₆ is H, C₁-C₈ alkyl or R₆ and R₁ together with the atoms to which theyare attached form a 4, 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo. In one specific embodiment the dipeptideelement comprises the structure of Formula X wherein

-   -   R₁ and R₂ are independently C1-C18 alkyl or aryl;    -   R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which        they are attached form a 4-12 heterocyclic ring;    -   R₄ and R₈ are independently selected from the group consisting        of hydrogen, C₁-C₁₈ alkyl and aryl; and    -   R₅ is an amine or a hydroxyl.

In another specific embodiment the dipeptide element comprises thestructure of Formula X wherein

-   -   R₁ is selected from the group consisting of hydrogen, C₁-C₁₈        alkyl and aryl, or R₁ and R₂ are linked through —(CH2)p-,        wherein p is 2-9;    -   R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which        they are attached form a 4-6 heterocyclic ring;    -   R₄ and R₈ are independently selected from the group consisting        of hydrogen, C₁-C₁₈ alkyl and aryl; and    -   R₅ is an amine or N-substituted amine.

In another specific embodiment the dipeptide element comprises thestructure of Formula X wherein

-   -   R₁ and R₂ are independently selected from the group consisting        of hydrogen, C₁-C₈ alkyl and aryl;    -   R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which        they are attached form a 4-6 heterocyclic ring;    -   R₄ and R₈ are each hydrogen; and    -   R₅ is selected from the group consisting of amine, N-substituted        amine and hydroxyl.

In one embodiment a prodrug form of a single chain insulin analogcomprises the structure IB-LM-IA, wherein IB comprises sequence

J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 20);

LM comprises the structure:

and

IA comprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 22), wherein m is an integer ranging from 5 to 15. In afurther embodiment m is an integer ranging from 7 to 13.

In one embodiment the single chain insulin analog comprises a sequenceselected from the group consisting of

a) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-(PEG)₈₋₁₄-GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO:22)

b) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVEALYLVCGERGFF (SEQ ID NO:53)-(PEG)₈₋₁₄-GIVEQCCX₈SICSLYQLENX₁₉CX₂₁—R₁₃ (SEQ ID NO: 55)

c) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVEALYLVCGERGFF (SEQ ID NO:53)-(PEG)₈₋₁₄-GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO:22); and

d) J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LYLVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO:21)-(PEG)₈₋₁₄-GIVEQCCX₈SICSLYQLENX₁₉CX₂₁—R₁₃ (SEQ ID NO: 55) wherein

J is H or a dipeptide element comprising the general structure of U-B,wherein U is an amino acid or a hydroxyl acid and B is an N-alkylatedamino acid linked through an amide bond;

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure U-B, wherein U is an amino acid or a hydroxyl acid and        B is an N-alkylated amino acid;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is an amino acid of the general structure

-   -   wherein X₁₃ is selected from the group consisting of H, OH and        NHR₁₂, wherein R₁₂ is H or dipeptide element comprising the        general structure U-B;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and a bond;

R₂₃ is a bond or G(X₆₀)_(d)(X₆₁)_(g)K (SEQ ID NO: 191)

-   -   wherein X₆₀, X₆₁ are independently glutamic acid or aspartic        acid; and    -   d and g are integers independently ranging from 1-6; and

R₁₃ is COOH or CONH₂. As used herein the designation “(PEG)_(n)” isintended to represent a polyethylene glycol having the number ofmonomers indicated by the subscript number, or range of numbers,provided outside the parenthesis. In one embodiment J is H and X₄₅ isphenylalanine or tyrosine. In a further embodiment R₂₃ is a bond orGX₆₀X₆₁X₆₂X₆₃X₆₄K (SEQ ID NO: 193, wherein X₆₀, X₆₁, X₆₂, X₆₃ and X₆₄are independently glutamic acid or aspartic acid.

In accordance with one embodiment a prodrug form of a single chaininsulin analog comprises the structure IB-LM-IA, wherein IB comprisesthe sequence R₂₂HLCGSX₃₀LVEALYLVCG ERGFF (SEQ ID NO: 154) orX₂₂VNQX₂₅LCGX₂₉X₃₀LVEALYLVCGERGFFYT-Z₁-B₁ (SEQ ID NO: 54); LM is alinking moiety as disclosed herein and IA comprises the sequenceGIVEQCCX₈SICSLYQLENX₁₉CX₂₁—R₁₃(SEQ ID NO: 55) orGIVX₄ECCX₈X₉SCDLX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 19), wherein

X₄ is glutamic acid or aspartic acid;

X₈ is histidine, threonine or phenylalanine;

X₉ is arginine, lysine, ornithine or alanine;

X₁₄ is arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₇ is glutamic acid or glutamine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure

wherein

-   -   m₁ is an integer ranging from 1 to 3;    -   R₁, R₂, R₄ and R₈ are independently selected from the group        consisting of H, C₁-C₁₈ alkyl, C₂-C₁₈ alkenyl, (C₁-C₁₈ alkyl)OH,        (C₁-C₁₈ alkyl)SH, (C₂-C₃ alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄        alkyl)COOH, (C₁-C₄ alkyl)NH₂, (C₁-C₄ alkyl)NHC(NH₂ ⁺)NH₂, (C₀-C₄        alkyl)(C₃-C₆ cycloalkyl), (C₀-C₄ alkyl)(C₂-C₅ heterocyclic),        (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, (C₁-C₄ alkyl)(C₃-C₉ heteroaryl),        and C₁-C₁₂ alkyl(W₁)C₁-C₁₂ alkyl, wherein W₁ is a heteroatom        selected from the group consisting of N, S and O, or R₁ and R₂        together with the atoms to which they are attached form a C₃-C₁₂        cycloalkyl; or R₄ and R₈ together with the atoms to which they        are attached form a C₃-C₆ cycloalkyl;    -   R₃ is selected from the group consisting of C₁-C₁₈ alkyl,        (C₁-C₁₈ alkyl)OH, (C₁-C₁₈ alkyl)NH₂, (C₁-C₁₈ alkyl)SH, (C₀-C₄        alkyl)(C₃-C₆)cycloalkyl, (C₀-C₄ alkyl)(C₂-C₅ heterocyclic),        (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, and (C₁-C₄ alkyl)(C₃-C₉        heteroaryl) or R₄ and R₃ together with the atoms to which they        are attached form a 4, 5 or 6 member heterocyclic ring;    -   R₅ is NHR₆ or OH;    -   R₆ is H, C₁-C₈ alkyl or R₆ and R₁ together with the atoms to        which they are attached form a 4, 5 or 6 member heterocyclic        ring; and    -   R₇ is selected from the group consisting of hydrogen, C₁-C₁₈        alkyl, C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH,        (C₀-C₄ alkyl)NH₂, (C₀-C₄ alkyl)OH, and halo;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₂ is selected from the group consisting of phenylalanine anddesamino-phenylalanine;

X₂₃ is asparagine or glycine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

Z₁ is a dipeptide selected from the group consisting ofaspartate-lysine, lysine-proline, and proline-lysine; and

B₁ is selected from the group consisting of threonine, alanine or athreonine-arginine-arginine tripeptide.

R₂₂ is selected from the group consisting of X₂₂VNQ (SEQ ID NO: 84), atripeptide valine-asparagine-glutamine, a dipeptideasparagine-glutamine, glutamine, and an N-terminal amine; and

R₁₃ is COOH or CONH₂. In one embodiment m₁ is 1. In accordance with oneembodiment the single chain insulin analog comprises the structureIB-LM-IA, wherein IB comprises the sequenceX₂₂VNQX₂₅LCGX₂₉X₃₀LVEALYLVCGERGFFYT-Z₁-B₁ (SEQ ID NO: 54), LM is alinking moiety as disclosed herein and IA comprises the sequenceGIVX₄ECCX₈X₉SCDLX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 19).

In one embodiment the single chain insulin analog comprises a compoundof the formula: IB-LM-IA, wherein IB represents an IGF YL B chaincomprising the sequence GPETLCGX₂₆ELVDX₂₇LYLVCGDX₄₂GFYFNKPT-R₁₄ (SEQ IDNO: 197), wherein X₂₆ and X₂₇ are each alanine and X₄₂ is arginine, orGPETLCGAELVDALYLVCGDRGFYFNPKT (SEQ ID NO: 89), LM represents a linkingmoiety as disclosed herein and IA represents an IGF A chain comprisingthe sequence GIVDECCHRSCDLRRLEMX₁₉CA-R₁₃ (SEQ ID NO: 155) orGIVDECCHOSCDLOOLQMX₁₉CN—R₁₃ (SEQ ID NO: 75) or the native insulinsequence GIVEQCCTSICSLYQLENX₁₉CN—R₁₃ (SEQ ID NO: 194) wherein

X₈ is histidine or phenylalanine;

X₁₉ is an amino acid of the general structure

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is a dipeptide comprising the general structure:

-   -   R₁ is selected from the group consisting of H and C₁-C₈ alkyl;    -   R₂ and R₄ are independently selected from the group consisting        of H, C₁-C₈ alkyl, C₂-C₈ alkenyl, (C₁-C₄ alkyl)OH, (C₁-C₄        alkyl)SH, (C₂-C₃ alkyl)SCH₃, (C₁-C₄ alkyl)CONH₂, (C₁-C₄        alkyl)COOH, (C₁-C₄ alkyl)NH₂, (C₁-C₄ alkyl)NHC(NH₂ ⁺) NH₂,        (C₀-C₄ alkyl)(C₃-C₆ cycloalkyl), (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇,        and CH₂(C₅-C₉ heteroaryl), or R₁ and R₂ together with the atoms        to which they are attached form a C₃-C₆ cycloalkyl;    -   R₃ is selected from the group consisting of C₁-C₈ alkyl, (C₁-C₄        alkyl)OH, (C₁-C₄ alkyl)NH₂, (C₁-C₄ alkyl)SH, and        (C₃-C₆)cycloalkyl or R₄ and R₃ together with the atoms to which        they are attached form a 5 or 6 member heterocyclic ring;    -   R₅ is NHR₆ or OH;    -   R₆ is H, or R₆ and R₂ together with the atoms to which they are        attached form a 5 or 6 member heterocyclic ring;    -   R₇ is selected from the group consisting of H and OH; and    -   R₈ is H; and

R₁₃ and R₁₄ are independently COOH or CONH₂. The present invention alsoencompasses any combination of insulin analog A chain and B chainpeptides, as disclosed herein, linked together via a linking moiety asdisclosed herein as a single chain insulin analog of the formulaIB-LM-IA.

The Dipeptide Prodrug Element

The substituents of the dipeptide prodrug element, and its site ofattachment to the single chain insulin analog, can be selected toprovide the desired half life of a prodrug analog of the single chaininsulin analogs disclosed herein. For example, when a dipeptide prodrugelement comprising the structure:

is linked to the alpha amino group of the N-terminal amino acid of thesingle chain insulin analog B chain, compounds having a t_(1/2) of about1 hour in PBS under physiological conditions are provided when

R₁ and R₂ are independently C₁-C₁₈ alkyl or aryl; or R₁ and R₂ arelinked through —(CH₂)_(p)—, wherein p is 2-9;

R₃ is C₁-C₁₈ alkyl;

R₄ and R₈ are each hydrogen; and

R₅ is an amine.

In other embodiments, prodrugs linked at the N-terminus and having at_(1/2) of, e.g., about 1 hour comprise a dipeptide prodrug element withthe structure:

wherein

R₁ and R₂ are independently C₁-C₁₈ alkyl or (C₀-C₄ alkyl)(C₆-C₁₀aryl)R₇; or R₁ and R₂ are linked through —(CH₂)_(p), wherein p is 2-9;

R₃ is C₁-C₁₈ alkyl;

R₄ and R₈ are each hydrogen;

R₅ is NH₂;

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo; and R₈ is H.

Alternatively, in one embodiment a single chain insulin analog prodrugderivative is provided wherein the dipeptide prodrug is linked to thealpha amino group of the N-terminal amino acid of the single chaininsulin analog B chain, and the prodrug has a t_(1/2) between about 6 toabout 24 hours in PBS under physiological conditions. In one embodimenta single chain insulin analog prodrug derivative having a t_(1/2)between about 6 to about 24 hours in PBS under physiological conditionsis provided wherein the prodrug element has the structure of Formula Xand

R₁ and R₂ are independently selected from the group consisting ofhydrogen, C₁-C₁₈ alkyl and aryl, or R₁ and R₂ are linked through—(CH₂)_(p)—, wherein p is 2-9;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-12 heterocyclic ring;

R₄ and R₈ are independently selected from the group consisting ofhydrogen, C₁-C₈ alkyl and aryl; and

R₅ is an amine, with the proviso that both R₁ and R₂ are not hydrogenand provided that one of R₄ or R₈ is hydrogen.

In a further embodiment a single chain insulin analog prodrug derivativeis provided wherein the dipeptide prodrug is linked to the alpha aminogroup of the N-terminal amino acid of the single chain insulin analog Bchain, and the prodrug has a t_(1/2) between about 72 to about 168 hoursin PBS under physiological conditions. In one embodiment a single chaininsulin analog prodrug derivative having a t_(1/2) between about 72 toabout 168 hours in PBS under physiological conditions is providedwherein the prodrug element has the structure of Formula X and

R₁ is selected from the group consisting of hydrogen, C₁-C₈ alkyl andaryl;

R₂ is H;

R₃ is C₁-C₁₈ alkyl;

R₄ and R₈ are each hydrogen; and

R₅ is an amine or N-substituted amine or a hydroxyl;

with the proviso that, if R₁ is alkyl or aryl, then R₁ and R₅ togetherwith the atoms to which they are attached form a 4-11 heterocyclic ring.

In some embodiments, prodrugs having the dipeptide prodrug elementlinked to the N-terminal alpha amino acid of the single chain insulinanalog B chain peptide and having a t_(1/2), e.g., between about 12 toabout 72 hours, or in some embodiments between about 12 to about 48hours, comprise a dipeptide prodrug element with the structure:

wherein R₁ and R₂ are independently selected from the group consistingof hydrogen, C₁-C₁₈ alkyl, (C₁-C₁₈ alkyl)OH, (C₁-C₄ alkyl)NH₂, and(C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, or R₁ and R₂ are linked through (CH₂)_(p),wherein p is 2-9;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-12 heterocyclic ring;

R₄ and R₈ are independently selected from the group consisting ofhydrogen, C₁-C₈ alkyl and (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₅ is NH₂; and

R₇ is selected from the group consisting of H, C₁-C₁₈ alkyl, C₂-C₁₈alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂, (C₀-C₄alkyl)OH, and halo;

with the proviso that both R₁ and R₂ are not hydrogen and provided thatat least one of R₄ or R₈ is hydrogen.

In some embodiments, prodrugs having the dipeptide prodrug elementlinked to the N-terminal amino acid of the single chain insulin analog Bchain peptide and having a t_(1/2), e.g., between about 12 to about 72hours, or in some embodiments between about 12 to about 48 hours,comprise a dipeptide prodrug element with the structure:

wherein R₁ and R₂ are independently selected from the group consistingof hydrogen, C₁-C₈ alkyl and (C₁-C₄ alkyl)NH₂, or R₁ and R₂ are linkedthrough (CH₂)_(p), wherein p is 2-9;

R₃ is C₁-C₈ alkyl or R₃ and R₄ together with the atoms to which they areattached form a 4-6 heterocyclic ring;

R₄ is selected from the group consisting of hydrogen and C₁-C₈ alkyl;and

R₅ is NH₂;

with the proviso that both R₁ and R₂ are not hydrogen.

In other embodiments, prodrugs having the dipeptide prodrug elementlinked to the N-terminal amino acid of the single chain insulin analog Bchain peptide and having a t_(1/2), e.g., between about 12 to about 72hours, or in some embodiments between about 12 to about 48 hours,comprise a dipeptide prodrug element with the structure:

wherein

R₁ and R₂ are independently selected from the group consisting ofhydrogen, C₁-C₈ alkyl and (C₁-C₄ alkyl)NH₂;

R₃ is C₁-C₆ alkyl;

R₄ is hydrogen; and

R₅ is NH₂;

with the proviso that both R₁ and R₂ are not hydrogen.

In some embodiments, prodrugs having the dipeptide prodrug elementlinked to the N-terminal amino acid of the single chain insulin analog Bchain peptide and having a t_(1/2), e.g., between about 12 to about 72hours, or in some embodiments between about 12 to about 48 hours,comprise a dipeptide prodrug element with the structure:

wherein

R₁ and R₂ are independently selected from the group consisting ofhydrogen and C₁-C₈ alkyl, (C₁-C₄ alkyl)NH₂, or R₁ and R₂ are linkedthrough (CH₂)_(p), wherein p is 2-9;

R₃ is C₁-C₈ alkyl;

R₄ is (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₅ is NH₂; and

R₇ is selected from the group consisting of hydrogen, C₁-C₈ alkyl and(C₀-C₄ alkyl)OH;

with the proviso that both R₁ and R₂ are not hydrogen.

In addition a prodrug having the dipeptide prodrug element linked to theN-terminal alpha amino acid of the single chain insulin analog andhaving a t_(1/2), e.g., of about 72 to about 168 hours is providedwherein the dipeptide prodrug element has the structure:

wherein R₁ is selected from the group consisting of hydrogen, C₁-C₈alkyl and (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₃ is C₁-C₁₈ alkyl;

R₄ and R₈ are each hydrogen;

R₅ is NHR₆ or OH;

R₆ is H, C₁-C₈ alkyl, or R₆ and R₁ together with the atoms to which theyare attached form a 4, 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo;

with the proviso that, if R₁ is alkyl or (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇,then R₁ and R₅ together with the atoms to which they are attached form a4-11 heterocyclic ring.

In some embodiments the dipeptide prodrug element is linked to a sidechain amine of an internal amino acid of the single chain insulinanalog. In this embodiment prodrugs having a t_(1/2), e.g., of about 1hour have the structure:

wherein

R₁ and R₂ are independently C₁-C₈ alkyl or (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;or R₁ and R₂ are linked through —(CH₂)_(p)—, wherein p is 2-9;

R₃ is C₁-C₁₈ alkyl;

R₄ and R₈ are each hydrogen;

R₅ is NH₂; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo.

Furthermore, prodrugs having a t_(1/2), e.g., between about 6 to about24 hours and having the dipeptide prodrug element linked to an internalamino acid side chain are provided wherein the prodrug comprises adipeptide prodrug element with the structure:

wherein R₁ and R₂ are independently selected from the group consistingof hydrogen, C₁-C₈ alkyl, and (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, or R₁ and R₂are linked through —(CH₂)_(p)—, wherein p is 2-9;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-12 heterocyclic ring;

R₄ and R₈ are independently hydrogen, C₁-C₁₈ alkyl or (C₀-C₄alkyl)(C₆-C₁₀ aryl)R₇; R₅ is NHR₆;

R₆ is H or C₁-C₈ alkyl, or R₆ and R₂ together with the atoms to whichthey are attached form a 4, 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo;

with the proviso that both R₁ and R₂ are not hydrogen and provided thatat least one of R₄ or R₈ is hydrogen.

In addition a prodrug having a t_(1/2), e.g., of about 72 to about 168hours and having the dipeptide prodrug element linked to a internalamino acid side chain of the single chain insulin analog is providedwherein the dipeptide prodrug element has the structure:

wherein R₁ is selected from the group consisting of hydrogen, C₁-C₁₈alkyl and (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₃ is C₁-C₁₈ alkyl;

R₄ and R₈ are each hydrogen;

R₅ is NHR₆ or OH;

R₆ is H or C₁-C₈ alkyl, or R₆ and R₁ together with the atoms to whichthey are attached form a 4, 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo; with the proviso that, if R₁ and R₂ are bothindependently an alkyl or (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇, either R₁ or R₂is linked through (CH₂)_(p) to R₅, wherein p is 2-9.

In some embodiments the dipeptide prodrug element is linked to a sidechain amine of an internal amino acid of the single chain insulin analogwherein the internal amino acid comprises the structure of Formula V

wherein

n is an integer selected from 1 to 4. In some embodiments n is 3 or 4and in some embodiments the internal amino acid is lysine. In someembodiments the dipeptide prodrug element is linked to a primary amineon a side chain of an amino acid located at position 28, or 29 of theB-chain of the single chain insulin analog.

In embodiments where the dipeptide prodrug element of formula X islinked to an amino substituent of an aryl group of an aromatic aminoacid, prodrug, the substituents of the prodrug element can be selectedto provide the desired time of activation. For example, the half life ofa prodrug analog of any of the single chain insulin analogs disclosedherein comprising an amino acid of the structure of Formula IV:

wherein m₁ is an integer from 0 to 3, can be selected by altering thesubstituents of R₁, R₂, R₃, R₄, R₅, and R₈. In one embodiment the aminoacid of formula V is present at an amino acid corresponding to positionA19, B16 or B25 of native insulin, and in one specific example the aminoacid of formula V is located at position A19 of the single chain insulinanalog, and m₁ is 1. In one embodiment a single chain insulin analogprodrug derivative comprising the structure of Formula IV and having at½ of about 1 hour in PBS under physiological conditions is provided. Inone embodiment the single chain insulin analog prodrug derivative havinga t½ of about 1 hour in PBS under physiological conditions comprises thestructure of formula IV wherein,

R₁ and R₂ are independently C₁-C₁₈ alkyl or aryl;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-12 heterocyclic ring;

R₄ and R₈ are independently selected from the group consisting ofhydrogen, C₁-C₁₈ alkyl and aryl; and

R₅ is an amine or a hydroxyl. In one embodiment m₁ is 1.

In one embodiment, the dipeptide prodrug element is linked to the singlechain insulin analog via an amine present on an aryl group of anaromatic amino acid of the single chain insulin analog, wherein theprodrug has a t_(1/2), e.g., of about 1 hour has a dipeptide structureof:

wherein R₁ and R₂ are independently C₁-C₁₈ alkyl or (C₀-C₄ alkyl)(C₆-C₁₀aryl)R₇;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-12 heterocyclic ring;

R₄ and R₈ are independently selected from the group consisting ofhydrogen, C₁-C₁₈ alkyl and (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₅ is NH₂ or OH; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo.

In another embodiment a single chain insulin analog prodrug derivativecomprising the structure of Formula IV, wherein m₁ is an integer from 0to 3 and having a t½ of about 6 to about 24 hours in PBS underphysiological conditions, is provided. In one embodiment where thesingle chain insulin analog prodrug having a t½ of about 6 to about 24hours in PBS under physiological conditions comprises the structure offormula IV wherein,

R₁ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl andaryl, or R₁ and R₂ are linked through —(CH₂)_(p)—, wherein p is 2-9;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-6 heterocyclic ring;

R₄ and R₈ are independently selected from the group consisting ofhydrogen, C₁-C₁₈ alkyl and aryl; and

R₅ is an amine or N-substituted amine. In one embodiment m₁ is 1.

In one embodiment, prodrugs having the dipeptide prodrug element linkedvia an aromatic amino acid and having a t_(1/2), e.g., of about 6 toabout 24 hours are provided wherein the dipeptide comprises a structureof:

wherein

R₁ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,(C₁-C₁₈ alkyl)OH, (C₁-C₄ alkyl)NH₂, and (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-6 heterocyclic ring;

R₄ and R₈ are independently selected from the group consisting ofhydrogen, C₁-C₁₈ alkyl and (C₀-C₄ alkyl)(C₆-C₁₀ aryl)R₇;

R₅ is NHR₆;

R₆ is H, C₁-C₈ alkyl, or R₆ and R₁ together with the atoms to which theyare attached form a 4, 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C₁₈ alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄ alkyl)NH₂,(C₀-C₄ alkyl)OH, and halo.

In another embodiment a single chain insulin analog prodrug derivativecomprising the structure of Formula IV, wherein m₁ is an integer from 0to 3 and having a t½ of about 72 to about 168 hours in PBS underphysiological conditions, is provided. In one embodiment where thesingle chain insulin analog prodrug derivative having a t½ of about 72to about 168 hours in PBS under physiological conditions comprises thestructure of formula IV wherein,

R₁ and R₂ are independently selected from the group consisting ofhydrogen, C₁-C₈ alkyl and aryl;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-6 heterocyclic ring;

R₄ and R₈ are each hydrogen; and

R₅ is selected from the group consisting of amine, N-substituted amineand hydroxyl. In one embodiment m₁ is 1.

In one embodiment, prodrugs having the dipeptide prodrug element linkedvia an aromatic amino acid and having a t_(1/2), e.g., of about 72 toabout 168 hours are provided wherein the dipeptide comprises a structureof:

wherein R₁ and R₂ are independently selected from the group consistingof hydrogen, C₁-C₈ alkyl, (C₁-C₄ alkyl)COOH, and (C₀-C₄ alkyl)(C₆-C₁₀aryl)R₇, or R₁ and R₅ together with the atoms to which they are attachedform a 4-11 heterocyclic ring;

R₃ is C₁-C₁₈ alkyl or R₃ and R₄ together with the atoms to which theyare attached form a 4-6 heterocyclic ring;

R₄ is hydrogen or forms a 4-6 heterocyclic ring with R₃;

R₈ is hydrogen;

R₅ is NHR₆ or OH;

R₆ is H or C₁-C₈ alkyl, or R₆ and R₁ together with the atoms to whichthey are attached form a 4, 5 or 6 member heterocyclic ring; and

R₇ is selected from the group consisting of hydrogen, C₁-C₁₈ alkyl,C₂-C_(1s) alkenyl, (C₀-C₄ alkyl)CONH₂, (C₀-C₄ alkyl)COOH, (C₀-C₄alkyl)NH₂, (C₀-C₄ alkyl)OH, and halo.

In accordance with one embodiment the dipeptide of Formula X is furthermodified to comprise a large polymer that interferes with the singlechain insulin analog's ability to interact with the insulin or IGF-1receptor. Subsequent cleavage of the dipeptide releases the single chaininsulin analog from the dipeptide complex wherein the released singlechain insulin analog is fully active. In accordance with one embodimentthe dipeptide of Formula X is further modified to comprises a largepolymer that interferes with the bound single chain insulin analog'sability to interact with the insulin or IGF-1 receptor. In accordancewith one embodiment the single chain insulin analog comprises adipeptide of the general structure of Formula X:

wherein one of the amino acid side chains of the dipeptide of Formula Xis pegylated or acylated.

In one embodiment a single chain insulin analog is provided thatcomprises the structure IB-LM-IA, wherein IB comprises sequence

J-R₂₃—R₂₂—X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 20);

LM comprises a linking moiety as described herein; and

IA comprises the sequence GIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LEX₁₈X₁₉CX₂₁—R₁₃(SEQ ID NO: 152), wherein

J is H or a dipeptide element of formula X;

X₄ is glutamic acid or aspartic acid;

X₅ is glutamine or glutamic acid

X₈ is histidine, threonine or phenylalanine;

X₉ is serine, arginine, lysine, ornithine or alanine;

X₁₀ is isoleucine or serine;

X₁₂ is serine or aspartic acid

X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;

X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine;

X₁₈ is methionine, asparagine, glutamine, aspartic acid, glutamic acidor threonine;

X₁₉ is an amino acid of the general structure:

-   -   wherein X is selected from the group consisting of OH or NHR₁₀,        wherein R₁₀ is H or a dipeptide element comprising the general        structure of Formula X;

X₂₁ is selected from the group consisting of alanine, glycine, serine,valine, threonine, isoleucine, leucine, glutamine, glutamic acid,asparagine, aspartic acid, histidine, tryptophan, tyrosine, andmethionine;

X₂₅ is histidine or threonine;

X₂₉ is selected from the group consisting of alanine, glycine andserine;

X₃₀ is selected from the group consisting of histidine, aspartic acid,glutamic acid, homocysteic acid and cysteic acid;

X₃₃ is selected from the group consisting of aspartic acid, glutamineand glutamic acid;

X₃₄ is selected from the group consisting of alanine and threonine;

X₄₁ is selected from the group consisting of glutamic acid, asparticacid or asparagine;

X₄₂ is selected from the group consisting of alanine, lysine, ornithineand arginine;

X₄₅ is an amino acid of the general structure

-   -   wherein X₁₃ is selected from the group consisting of H, OH and        NHR₁₂, wherein R₁₂ is H or dipeptide element comprising the        general structure of Formula X;

R₂₂ is selected from the group consisting of AYRPSE (SEQ ID NO: 14),FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptideglycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine,a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine,glutamine, glutamic acid and an N-terminal amine;

R₂₃ is a bond or G(X₆₀)_(d)(X₆₁)_(g)K (SEQ ID NO: 191)

-   -   wherein X₆₀, X₆₁ are independently glutamic acid or aspartic        acid; and    -   d and g are integers independently ranging from 1-6; and

R₁₃ is COOH or CONH, further wherein the dipeptide of Formula X isacylated or pegylated. In one embodiment J comprises an acylated orpegylated dipeptide of Formula X.

The single chain insulin analogs and prodrug derivative thereofdisclosed herein can be further modified to improve the peptide'ssolubility in aqueous solutions at physiological pH, while enhancing theeffective duration of the peptide by preventing renal clearance of thepeptide. Peptides are easily cleared because of their relatively smallmolecular size when compared to plasma proteins. Increasing themolecular weight of a peptide above 40 kDa exceeds the renal thresholdand significantly extends duration in the plasma. Accordingly, in oneembodiment the peptide prodrugs are further modified to comprise acovalently linked hydrophilic moiety.

In one embodiment the hydrophilic moiety is a plasma protein,polyethylene glycol chain or the Fc portion of an immunoglobin.Therefore, in one embodiment the presently disclosed insulin analogs arefurther modified to comprise one or more hydrophilic groups covalentlylinked to the side chains of amino acids.

In accordance with one embodiment the insulin prodrugs disclosed hereinare further modified by linking a hydrophilic moiety to either theN-terminal amino acid of the B chain or to the side chain of a lysineamino acid (or other suitable amino acid) located at the carboxyterminus of the B chain, including for example, at position 28 of SEQ IDNO: 89. In one embodiment a single-chain insulin prodrug derivative isprovided wherein one of the amino acids of the linking moiety ismodified by linking a hydrophilic moiety to the side chain of thepeptide linker. In one embodiment the modified amino acid is cysteine,lysine or acetyl phenylalanine.

In accordance with one embodiment a prodrug derivative of the singlechain insulin analog is provided wherein the dipeptide element ofFormula X further comprises an polyethylene glycol, alkyl or acyl group.In one embodiment one or more polyethylene glycol chains are linked tothe dipeptide of Formula X wherein the combined molecular weight of thepolyethylene glycol chains ranges from about 20,000 to about 80,000Daltons, or 40,000 to 80,000 Daltons or 40,000 to 60,000 Daltons. In oneembodiment at least one polyethylene glycol chain having a molecularweight of about 40,000 Daltons is linked to the dipeptide of Formula X.In another embodiment the dipeptide of Formula X is acylated with anacyl group of sufficient size to bind serum albumin and thus inactivatethe IGF^(B16B17) analog peptide upon administration. The acyl group canbe linear or branched, and in one embodiment is a C16 to C30 fatty acid.For example, the acyl group can be any of a C16 fatty acid, C18 fattyacid, C20 fatty acid, C22 fatty acid, C24 fatty acid, C26 fatty acid,C28 fatty acid, or a C30 fatty acid. In some embodiments, the acyl groupis a C16 to C20 fatty acid, e.g., a C18 fatty acid or a C20 fatty acid.

In another embodiment the single chain insulin analog peptides, andtheir prodrug analogs, disclosed herein are further modified by theaddition of a modified amino acid to the carboxy or amino terminus ofthe A chain or the amino terminus of the B chain of the single chaininsulin analog peptide, wherein the added amino acid is modified tocomprise a hydrophilic moiety linked to the amino acid. In oneembodiment the amino acid added to the C-terminus is a modifiedcysteine, lysine or acetyl phenylalanine. In one embodiment thehydrophilic moiety is selected from the group consisting of a plasmaprotein, polyethylene glycol chain and an Fc portion of an immunoglobin.

In one embodiment the hydrophilic group is a polyethylene glycol chain,and in one embodiment two or more polyethylene glycol chains arecovalently attached to two or more amino acid side chains of the singlechain insulin analog. In accordance with one embodiment the hydrophilicmoiety is covalently attached to an amino acid side chain of a singlechain insulin analog disclosed herein at a position corresponding toA10, B28, B29, the C-terminus of the A chain or the N-terminus of the Bchain (positions relative to native insulin). For single chain insulinanalogs and their prodrug derivatives having multiple polyethyleneglycol chains, the polyethylene glycol chains can be attached at theN-terminal amino acid of the B chain or to the side chain of a lysineamino acid located at the carboxy terminus of the B chain, or by theaddition of a single amino acid at the C-terminus of the peptide whereinthe added amino acid has a polyethylene glycol chain linked to its sidechain. In accordance with one embodiment a prodrug derivative isprovided wherein the polyethylene glycol chain or other hydrophilicmoiety is linked to the side chain of one of the two amino acidscomprising the dipeptide prodrug element. In one embodiment thedipeptide prodrug element comprises a lysine (in the D or L stereoisomerconfiguration) with a polyethylene glycol chain attached to the sidechain amine of the lysine.

In accordance with one embodiment, the single chain insulin analogpeptides, or prodrug derivatives thereof, disclosed herein are furthermodified by amino acid substitutions, wherein the substituting aminoacid comprises a side chain suitable for crosslinking with hydrophilicmoieties, including for example, polyethylene glycol. For example, inone embodiment a native amino acid at a position corresponding to A5,A8, A9, A10, A12, A14, A15, A17, A18, B1, B2, B3, B4, B5, B13, B14, B17,B21, B22, B26, B27, B28, B29 and B30 of native insulin is substitutedwith a lysine, cysteine or acetyl phenylalanine residue (or a lysine,cysteine or acetyl phenylalanine residue is added to the C-terminus) toallow for the covalent attachment of a polyethylene glycol chain.

In one embodiment the single chain insulin analog, or prodrug derivativethereof, has a single cysteine substitution or a single cysteine residueadded to the amino or carboxy terminus of the single chain insulinanalog, or an amino acid within the linking moiety or the dipeptideelement of an insulin prodrug derivative is substituted with at leastone cysteine residue, wherein the side chain of the cysteine residue isfurther modified with a thiol reactive reagent, including for example,maleimido, vinyl sulfone, 2-pyridylthio, haloalkyl, and haloacyl. Thesethiol reactive reagents may contain carboxy, keto, hydroxyl, and ethergroups as well as other hydrophilic moieties such as polyethylene glycolunits. In an alternative embodiment, the single chain insulin analog, orprodrug derivative thereof, has a single lysine substitution or a singlelysine residue added to the amino or carboxy terminus of the singlechain insulin analog, or an amino acid within the linking moiety or thedipeptide element of an insulin prodrug derivative is substituted withlysine, and the side chain of the substituting lysine residue is furthermodified using amine reactive reagents such as active esters(succinimido, anhydride, etc) of carboxylic acids or aldehydes ofhydrophilic moieties such as polyethylene glycol.

In accordance with one embodiment a pharmaceutical composition isprovided comprising any of the novel single chain insulin analogsdisclosed herein, preferably at a purity level of at least 90%, 91%,92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%, and a pharmaceuticallyacceptable diluent, carrier or excipient. Such compositions may containa single chain insulin analog as disclosed herein at a concentration ofat least 0.5 mg/ml, 1 mg/ml, 2 mg/ml, 3 mg/ml, 4 mg/ml, 5 mg/ml, 6mg/ml, 7 mg/ml, 8 mg/ml, 9 mg/ml, 10 mg/ml, 11 mg/ml, 12 mg/ml, 13mg/ml, 14 mg/ml, 15 mg/ml, 16 mg/ml, 17 mg/ml, 18 mg/ml, 19 mg/ml, 20mg/ml, 21 mg/ml, 22 mg/ml, 23 mg/ml, 24 mg/ml, 25 mg/ml or higher. Inone embodiment the pharmaceutical compositions comprise aqueoussolutions that are sterilized and optionally stored contained withinvarious package containers. In other embodiments the pharmaceuticalcompositions comprise a lyophilized powder. The pharmaceuticalcompositions can be further packaged as part of a kit that includes adisposable device for administering the composition to a patient. Thecontainers or kits may be labeled for storage at ambient roomtemperature or at refrigerated temperature.

In one embodiment, a composition is provided comprising a mixture of afirst and second single chain insulin analog prodrug derivative, whereinthe first and second single chain insulin analog prodrug derivativesdiffer from one another based on the structure of the prodrug element.More particularly, the first single chain insulin analog prodrugderivative may comprise a dipeptide prodrug element that has a half lifesubstantially different from the dipeptide prodrug element of the secondsingle chain insulin analog prodrug derivative. Accordingly, selectionof different combinations of substituents on the dipeptide element willallow for the preparation of compositions that comprise a mixture ofsingle chain insulin analog prodrug derivatives that are activated in acontrolled manner over a desired time frame and at specific timeintervals. For example, the compositions can be formulated to releaseactive single chain insulin analog peptide at mealtimes followed by asubsequent activation single chain insulin analog peptide duringnighttime with suitable dosages being released based on time ofactivation.

In another embodiment the pharmaceutical composition comprises a mixtureof a single chain insulin analog prodrug derivative disclosed herein andnative insulin, or a known bioactive analog of insulin. The mixture inone embodiment can be in the form of a heteroduplex linking a singlechain insulin analog and a native insulin, or a known bioactive analogof insulin. The dimers may comprise a single chain insulin analogpeptide linked to another single chain insulin analog or to a disulfidelinked A chain to B chain insulin heteroduplex. The mixtures maycomprise one or more of the single chain insulin analogs, nativeinsulin, or a known bioactive analog of insulin, in prodrug derivativesthereof or depot derivative thereof or other conjugate forms, and anycombination thereof, as disclosed herein.

The disclosed single chain insulin analogs, and their correspondingprodrug derivatives, are believed to be suitable for any use that haspreviously been described for insulin peptides. Accordingly, the singlechain insulin analogs, and their corresponding prodrug derivatives,described herein can be used to treat hyperglycemia, or treat othermetabolic diseases that result from high blood glucose levels.Accordingly, the present invention encompasses pharmaceuticalcompositions comprising a single chain insulin analog as disclosedherein, or a prodrug derivative thereof, and a pharmaceuticallyacceptable carrier for use in treating a patient suffering from highblood glucose levels. In accordance with one embodiment the patient tobe treated using a single chain insulin analog disclosed herein is adomesticated animal, and in another embodiment the patient to be treatedis a human.

One method of treating hyperglycemia in accordance with the presentdisclosure comprises the steps of administering the presently disclosedsingle chain insulin analog, or depot or prodrug derivative thereof, toa patient using any standard route of administration, includingparenterally, such as intravenously, intraperitoneally, subcutaneouslyor intramuscularly, intrathecally, transdermally, rectally, orally,nasally or by inhalation. In one embodiment the composition isadministered subcutaneously or intramuscularly. In one embodiment, thecomposition is administered parenterally and the single chain insulinanalog, or prodrug derivative thereof, is prepackaged in a syringe.

The single chain insulin analog disclosed herein, and depot or prodrugderivative thereof, may be administered alone or in combination withother anti-diabetic agents. Anti-diabetic agents known in the art orunder investigation include native insulin, native glucagon andfunctional analogs thereof, sulfonylureas, such as tolbutamide(Orinase), acetohexamide (Dymelor), tolazamide (Tolinase),chlorpropamide (Diabinese), glipizide (Glucotrol), glyburide (Diabeta,Micronase, Glynase), glimepiride (Amaryl), or gliclazide (Diamicron);meglitinides, such as repaglinide (Prandin) or nateglinide (Starlix);biguanides such as metformin (Glucophage) or phenformin;thiazolidinediones such as rosiglitazone (Avandia), pioglitazone(Actos), or troglitazone (Rezulin), or other PPARy inhibitors; alphaglucosidase inhibitors that inhibit carbohydrate digestion, such asmiglitol (Glyset), acarbose (Precose/Glucobay); exenatide (Byetta) orpramlintide; Dipeptidyl peptidase-4 (DPP-4) inhibitors such asvildagliptin or sitagliptin; SGLT (sodium-dependent glucosetransporter 1) inhibitors; or FBPase (fructose 1,6-bisphosphatase)inhibitors.

Pharmaceutical compositions comprising the single chain insulin analogsdisclosed herein, or depot or prodrug derivatives thereof, can beformulated and administered to patients using standard pharmaceuticallyacceptable carriers and routes of administration known to those skilledin the art. Accordingly, the present disclosure also encompassespharmaceutical compositions comprising one or more of the single chaininsulin analogs disclosed herein (or prodrug derivative thereof), or apharmaceutically acceptable salt thereof, in combination with apharmaceutically acceptable carrier. In one embodiment thepharmaceutical composition comprises a 1 mg/ml concentration of thesingle chain insulin analog at a pH of about 4.0 to about 7.0 in aphosphate buffer system. The pharmaceutical compositions may comprisethe single chain insulin analog as the sole pharmaceutically activecomponent, or the single chain insulin analog peptide can be combinedwith one or more additional active agents.

All therapeutic methods, pharmaceutical compositions, kits and othersimilar embodiments described herein contemplate that single chaininsulin analog peptides, or prodrug derivatives thereof, include allpharmaceutically acceptable salts thereof.

In one embodiment the kit is provided with a device for administeringthe single chain insulin analog composition to a patient. The kit mayfurther include a variety of containers, e.g., vials, tubes, bottles,and the like. Preferably, the kits will also include instructions foruse. In accordance with one embodiment the device of the kit is anaerosol dispensing device, wherein the composition is prepackaged withinthe aerosol device. In another embodiment the kit comprises a syringeand a needle, and in one embodiment the single chain insulin analogcomposition is prepackaged within the syringe.

The compounds of this invention may be prepared by standard syntheticmethods, recombinant DNA techniques, or any other methods of preparingpeptides and fusion proteins. Although certain non-natural amino acidscannot be expressed by standard recombinant DNA techniques, techniquesfor their preparation are known in the art. Compounds of this inventionthat encompass non-peptide portions may be synthesized by standardorganic chemistry reactions, in addition to standard peptide chemistryreactions when applicable.

Example 1 Synthesis of Insulin A & B Chains

Insulin A & B chains were synthesized on 4-methylbenzhyryl amine (MBHA)resin or 4-Hydroxymethyl-phenylacetamidomethyl (PAM) resin using Bocchemistry. The peptides were cleaved from the resin using HF/p-cresol95:5 for 1 hour at 0° C. Following HF removal and ether precipitation,peptides were dissolved into 50% aqueous acetic acid and lyophilized.Alternatively, peptides were synthesized using Fmoc chemistry. Thepeptides were cleaved from the resin using Trifluoroacetic acid(TFA)/Triisopropylsilane (TIS)/H₂O (95:2.5:2.5), for 2 hour at roomtemperature. The peptide was precipitated through the addition of anexcessive amount of diethyl ether and the pellet solubilized in aqueousacidic buffer. The quality of peptides were monitored by RP-HPLC andconfirmed by Mass Spectrometry (ESI or MALDI).

Insulin A chains were synthesized with a single free cysteine at aminoacid 7 and all other cysteines protected as acetamidomethylA-(SH)⁷(Acm)^(6,11,20). Insulin B chains were synthesized with a singlefree cysteine at position 7 and the other cysteine protected asacetamidomethyl B-(SH)⁷(Acm)¹⁹. The crude peptides were purified byconventional RP-HPLC.

The synthesized A and B chains were linked to one another through theirnative disulfide bond linkage in accordance with the general procedureoutlined in FIG. 1. The respective B chain was activated to theCys⁷-Npys analog through dissolution in DMF or DMSO and reacted with2,2′-Dithiobis(5-nitropyridine) (Npys) at a 1:1 molar ratio, at roomtemperature. The activation was monitored by RP-HPLC and the product wasconfirmed by ESI-MS.

The first B7-A7 disulfide bond was formed by dissolution of therespective A-(SH)⁷(Acm)^(6,11,20) and B-(Npys)⁷(Acm)¹⁹ at 1:1 molarratio to a total peptide concentration of 10 mg/ml. When the chaincombination reaction was complete the mixture was diluted to aconcentration of 50% aqueous acetic acid. The last two disulfide bondswere formed simultaneously through the addition of iodine. A 40 foldmolar excess of iodine was added to the solution and the mixture wasstirred at room temperature for an additional hour. The reaction wasterminated by the addition of an aqueous ascorbic acid solution. Themixture was purified by RP-HPLC and the final compound was confirmed byMALDI-MS. As shown in FIG. 2 and the data in Table 1, the syntheticinsulin prepared in accordance with this procedure compares well withpurified insulin for insulin receptor binding.

Insulin peptides comprising a modified amino acid (such as 4-aminophenylalanine at position A19) can also be synthesized in vivo using asystem that allows for incorporation of non-coded amino acids intoproteins, including for example, the system taught in U.S. Pat. Nos.7,045,337 and 7,083,970.

TABLE 1 Activity of synthesized insulin relative to native insulinInsulin Standard A7-B7 Insulin AVER. STDEV AVER. STDEV IC₅₀(nM) 0.240.07 0.13 0.08 % of Insulin Activity 100 176.9

Example 2 Pegylation of Amine Groups (N-Terminus and Lysine) byReductive Alkylation

a. Synthesis

Insulin (or an insulin analog), mPEG20k-Aldyhyde, and NaBH₃CN, in amolar ratio of 1:2:30, were dissolved in acetic acid buffer at a pH of4.1-4.4. The reaction solution was composed of 0.1 N NaCl, 0.2 N aceticacid and 0.1 N Na₂CO₃. The insulin peptide concentration wasapproximately 0.5 mg/ml. The reaction occurs over six hours at roomtemperature. The degree of reaction was monitored by RP-HPLC and theyield of the reaction was approximately 50%.

b. Purification

The reaction mixture was diluted 2-5 fold with 0.1% TFA and applied to apreparative RP-HPLC column. HPLC condition: C4 column; flow rate 10ml/min; A buffer 10% ACN and 0.1% TFA in water; B buffer 0.1% TFA inACN; A linear gradient B % from 0-40% (0-80 min); PEG-insulin oranalogues was eluted at approximately 35% buffer B. The desiredcompounds were verified by MALDI-TOF, following chemical modificationthrough sulftolysis or trypsin degradation.

Pegylation of Amine Groups (N-Terminus and Lysine) byN-Hydroxysuccinimide Acylation

a. Synthesis

Insulin (or an insulin analog) along with mPEG20k-NHS were dissolved in0.1 N Bicine buffer (pH 8.0) at a molar ratio of 1:1. The insulinpeptide concentration was approximately 0.5 mg/ml. Reaction progress wasmonitored by HPLC. The yield of the reaction is approximately 90% after2 hours at room temperature.

b. Purification

The reaction mixture was diluted 2-5 fold and loaded to RP-HPLC.

HPLC condition: C4 column; flow rate 10 ml/min; A buffer 10% ACN and0.1% TFA in water; B buffer 0.1% TFA in ACN; A linear gradient B % from0-40% (0-80 min); PEG-insulin or analogues was collected atapproximately 35% B. The desired compounds were verified by MALDI-TOF,following chemical modification through sulftolysis or trypsindegradation.

Reductive Aminated Pegylation of Acetyl Group on the Aromatic Ring ofthe Phenylalanine

a. Synthesis

Insulin (or an insulin analogue), mPEG20k-Hydrazide, and NaBH₃CN in amolar ratio of 1:2:20 were dissolved in acetic acid buffer (pH of 4.1 to4.4). The reaction solution was composed of 0.1 N NaCl, 0.2 N aceticacid and 0.1 N Na₂CO₃. Insulin or insulin analogue concentration wasapproximately 0.5 mg/ml. at room temperature for 24 h. The reactionprocess was monitored by HPLC. The conversion of the reaction wasapproximately 50%. (calculated by HPLC)

b. Purification

The reaction mixture was diluted 2-5 fold and loaded to RP-HPLC.

HPLC condition: C4 column; flow rate 10 ml/min; A buffer 10% ACN and0.1% TFA in water; B buffer 0.1% TFA in ACN; A linear gradient B % from0-40% (0-80 min); PEG-insulin, or the PEG-insulin analogue was collectedat approximately 35% B. The desired compounds were verified byMALDI-TOF, following chemical modification through sulftolysis ortrypsin degradation.

Example 3 Insulin Receptor Binding Assay

The affinity of each peptide for the insulin or IGF-1 receptor wasmeasured in a competition binding assay utilizing scintillationproximity technology. Serial 3-fold dilutions of the peptides were madein Tris-Cl buffer (0.05 M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.1% w/v bovineserum albumin) and mixed in 96 well plates (Corning Inc., Acton, Mass.)with 0.05 nM (3-[125I]-iodotyrosyl) A TyrA14 insulin or(3-[125I]-iodotyrosyl) IGF-1 (Amersham Biosciences, Piscataway, N.J.).An aliquot of 1-6 micrograms of plasma membrane fragments prepared fromcells over-expressing the human insulin or IGF-1 receptors were presentin each well and 0.25 mg/well polyethylene imine-treated wheat germagglutinin type A scintillation proximity assay beads (AmershamBiosciences, Piscataway, N.J.) were added. After five minutes of shakingat 800 rpm the plate was incubated for 12 h at room temperature andradioactivity was measured with MicroBeta1450 liquid scintillationcounter (Perkin-Elmer, Wellesley, Mass.). Non-specifically bound (NSB)radioactivity was measured in the wells with a four-fold concentrationexcess of “cold” native ligand than the highest concentration in testsamples. Total bound radioactivity was detected in the wells with nocompetitor. Percent specific binding was calculated as following: %Specific Binding=(Bound-NSB/Total bound-NSB)×100. IC50 values weredetermined by using Origin software (OriginLab, Northampton, Mass.).

Example 4 Insulin Receptor Phosphorylation Assay

To measure receptor phosphorylation of insulin or insulin analog,receptor transfected HEK293 cells were plated in 96 well tissue cultureplates (Costar #3596, Cambridge, Mass.) and cultured in Dulbecco'smodified Eagle medium (DMEM) supplemented with 100 IU/ml penicillin, 100μg/ml streptomycin, 10 mM HEPES and 0.25% bovine growth serum (HyCloneSH30541, Logan, Utah) for 16-20 hrs at 37° C., 5% CO₂ and 90% humidity.Serial dilutions of insulin or insulin analogs were prepared in DMEMsupplemented with 0.5% bovine serum albumin (Roche Applied Science#100350, Indianapolis, Ind.) and added to the wells with adhered cells.After 15 min incubation at 37° C. in humidified atmosphere with 5% CO₂the cells were fixed with 5% paraformaldehyde for 20 min at roomtemperature, washed twice with phosphate buffered saline pH 7.4 andblocked with 2% bovine serum albumin in PBS for 1 hr. The plate was thenwashed three times and filled with horseradish peroxidase-conjugatedantibody against phosphotyrosine (Upstate biotechnology #16-105,Temecula, Calif.) reconstituted in PBS with 2% bovine serum albumin permanufacturer's recommendation. After 3 hrs incubation at roomtemperature the plate was washed 4 times and 0.1 ml of TMB singlesolution substrate (Invitrogen, #00-2023, Carlbad, Calif.) was added toeach well. Color development was stopped 5 min later by adding 0.05 ml 1N HCl. Absorbance at 450 nm was measured on Titertek Multiscan MCC340(ThermoFisher, Pittsburgh, Pa.). Absorbance vs. peptide concentrationdose response curves were plotted and EC₅₀ values were determined byusing Origin software (OriginLab, Northampton, Mass.).

Example 5 Determination of Rate of Model Dipeptide Cleavage (in PBS)

A specific hexapeptide (HSRGTF-NH₂; SEQ ID NO: 156) was used as a modelpeptide upon which the rate of cleavage of dipeptide N-terminalextensions could be studied. The dipeptide-extended model peptides wereprepared Boc-protected sarcosine and lysine were successively added tothe model peptide-bound resin to produce peptide A (Lys-Sar-HSRGTF-NH₂;SEQ ID NO: 157). Peptide A was cleaved by HF and purified by preparativeHPLC. Preparative purification using HPLC:

Purification was performed using HPLC analysis on a silica based 1×25 cmVydac C18 (5μ particle size, 300 A° pore size) column. The instrumentsused were: Waters Associates model 600 pump, Injector model 717, and UVdetector model 486. A wavelength of 230 nm was used for all samples.Solvent A contained 10% CH₃CN/0.1% TFA in distilled water, and solvent Bcontained 0.1% TFA in CH₃CN. A linear gradient was employed (0 to 100% Bin 2 hours). The flow rate was 10 ml/min and the fraction size was 4 ml.From ˜150 mgs of crude peptide, 30 mgs of the pure peptide was obtained.

Peptide A was dissolved at a concentration of 1 mg/ml in PBS buffer. Thesolution was incubated at 37° C. Samples were collected for analysis at5 h, 8 h, 24 h, 31 h, and 47 h. The dipeptide cleavage was quenched bylowering the pH with an equal volume of 0.1% TFA. The rate of cleavagewas qualitatively monitored by LC-MS and quantitatively studied by HPLC.The retention time and relative peak area for the prodrug and the parentmodel peptide were quantified using Peak Simple Chromatography software.

Analysis Using Mass Spectrometry

The mass spectra were obtained using a Sciex API-III electrosprayquadrapole mass spectrometer with a standard ESI ion source. Ionizationconditions that were used are as follows: ESI in the positive-ion mode;ion spray voltage, 3.9 kV; orifice potential, 60 V. The nebulizing andcurtain gas used was nitrogen flow rate of 0.9 L/min. Mass spectra wererecorded from 600-1800 Thompsons at 0.5 Th per step and 2 msec dwelltime. The sample (about 1 mg/mL) was dissolved in 50% aqueousacetonitrile with 1% acetic acid and introduced by an external syringepump at the rate of 5 μL/min. Peptides solubilized in PBS were desaltedusing a ZipTip solid phase extraction tip containing 0.6 μL C4 resin,according to instructions provided by the manufacturer (MilliporeCorporation, Billerica, Mass.) prior to analysis.

Analysis Using HPLC

The HPLC analyses were performed using a Beckman System GoldChromatography system equipped with a UV detector at 214 nm and a 150mm×4.6 mm C8 Vydac column. The flow rate was 1 ml/min. Solvent Acontained 0.1% TFA in distilled water, and solvent B contained 0.1% TFAin 90% CH₃CN. A linear gradient was employed (0% to 30% B in 10minutes). The data were collected and analyzed using Peak SimpleChromatography software.

The rate of cleavage was determined for the respective propeptides. Theconcentrations of the propeptides and the model parent peptide weredetermined by their respective peak areas. The first order dissociationrate constants of the prodrugs were determined by plotting the logarithmof the concentration of the prodrug at various time intervals. The slopeof this plot provides the rate constant ‘k’. The half lives for cleavageof the various prodrugs were calculated by using the formulat_(1/2)=0.693/k. The half life of the Lys-Sar extension to this modelpeptide HSRGTF-NH₂ (SEQ ID NO: 156) was determined to be 14.0 h.

Example 6 Rate of Dipeptide Cleavage Half Time in Plasma as Determinedwith an all d-Isoform Model Peptide

An additional model hexapeptide (dHdTdRGdTdF-NH₂ SEQ ID NO: 158) wasused to determine the rate of dipeptide cleavage in plasma. The d-isomerof each amino acid was used to prevent enzymatic cleavage of the modelpeptide, with the exception of the prodrug extension. This modeld-isomer hexapeptide was synthesized in an analogous fashion to the1-isomer. The sarcosine and lysine were successively added to theN-terminus as reported previously for peptide A to prepare peptide B(dLys-dSar-dHdTdRGdTdF-NH₂ SEQ ID NO: 159)

The rate of cleavage was determined for the respective propeptides. Theconcentrations of the propeptides and the model parent peptide weredetermined by their respective peak areas. The first order dissociationrate constants of the prodrugs were determined by plotting the logarithmof the concentration of the prodrug at various time intervals. The slopeof this plot provides the rate constant ‘k’. The half life of theLys-Sar extension to this model peptide dHdTdRGdTdF-NH₂ (SEQ ID NO: 158)was determined to be 18.6 h.

Example 7

The rate of cleavage for additional dipeptides linked to the modelhexapeptide (HSRGTF-NH₂; SEQ ID NO: 156) were determined using theprocedures described in Example 5. The results generated in theseexperiments are presented in Tables 2 and 3.

TABLE 2: Cleavage of the Dipeptide U—B that are linked to the side chainof an N-terminal para- amino-Phe from the Model Hexapeptide (HSRGTF—NH₂;SEQ ID NO: 156) in PBS

Compounds U (amino acid) O (amino acid) t_(1/2)  1 F P 58 h  2Hydroxyl-F P 327 h  3 d-F P 20 h  4 d-F d-P 39 h  5 G P 72 h  6Hydroxyl-G P 603 h  7 L P 62 h  8 tert-L P 200 h  9 S P 34 h 10 P P 97 h11 K P 33 h 12 dK P 11 h 13 E P 85 h 14 Sar P ≈1000 h 15 Aib P 69 min 16Hydroxyl-Aib P 33 h 17 cyclohexane P 6 min 18 G G No cleavage 19Hydroxyl-G G No cleavage 20 S N-Methyl-Gly 4.3 h 21 K N-Methyl-Gly 5.2 h22 Aib N-Methyl-Gly 7.1 min 23 Hydroxyl-Aib N-Methyl-Gly 1.0 h

TABLE 3 Cleavage of the Dipeptides U-B linked to histidine (or histidineanalog) at position 1 (X) from the Model Hexapeptide (XSRGTF-NH₂; SEQ IDNO: 160) in PBS NH₂-U-B-XSRGTF-NH₂ (SEQ ID NO: 160) Comd. U (amino acid)O (amino acid) X (amino acid) t_(1/2) 1 F P H No cleavage 2 Hydroxyl-F PH No cleavage 3 G P H No cleavage 4 Hydroxyl-G P H No cleavage 5 A P HNo cleavage 6 C P H No cleavage 7 S P H No cleavage 8 P P H No cleavage9 K P H No cleavage 10 E P H No cleavage 11 Dehydro V P H No cleavage 12P d-P H No cleavage 13 d-P P H No cleavage 14 Aib P H 32 h 15 Aib d-P H20 h 16 Aib P d-H 16 h 17 Cyclohexyl- P H 5 h 18 Cyclopropyl- P H 10 h19 N—Me-Aib P H >500 h 20 α,α-diethyl- P H 46 h Gly 21 Hydroxyl-Aib P H61 22 Aib P A 58 23 Aib P N-Methyl-His 30 h 24 Aib N-Methyl-Gly H 49 min25 Aib N-Hexyl-Gly H 10 min 26 Aib Azetidine-2- H >500 h carboxylic acid27 G N-Methyl-Gly H 104 h 28 Hydroxyl-G N-Methyl-Gly H 149 h 29 GN-Hexyl-Gly H 70 h 30 dK N-Methyl-Gly H 27 h 31 dK N-Methyl-Ala H 14 h32 dK N-Methyl-Phe H 57 h 33 K N-Methyl-Gly H 14 h 34 F N-Methyl-Gly H29 h 35 S N-Methyl-Gly H 17 h 36 P N-Methyl-Gly H 181 h

Example 8 Identification of an Insulin Analog with Structure Suitablefor Prodrug Construction

Position 19 of the A chain is known to be an important site for insulinactivity. Modification at this site to allow the attachment of a prodrugelement is therefore desirable. Specific analogs of insulin at A19 havebeen synthesized and characterized for their activity at the insulinreceptors. Two highly active structural analogs have been identified atA19, wherein comparable structural changes at a second active sitearomatic residue (B24) were not successful in identification ofsimilarly full activity insulin analogs.

Tables 4 and 5 illustrate the high structural conservation at positionA19 for full activity at the insulin receptor (receptor bindingdetermined using the assay described in Example 3). Table 4 demonstratesthat only two insulin analogs with modifications at A19 have receptorbinding activities similar to native insulin. For the 4-amino insulinanalog, data from three separate experiments is provided. The columnlabeled “Activity (in test)” compares the percent binding of the insulinanalog relative to native insulin for two separate experiments conductedsimultaneously. The column labeled “Activity (0.60 nM)” is the relativepercent binding of the insulin analog relative to the historical averagevalue obtained for insulin binding using this assay. Under eitheranalysis, two A19 insulin analogs (4-amino phenylalanine and 4-methoxyphenylalanine) demonstrate receptor binding approximately equivalent tonative insulin. FIG. 3 represents a graph demonstrating the respectivespecific binding of native insulin and the A19 insulin analog to theinsulin receptor. Table 5 presents data showing that the two A19 insulinanalogs (4-amino and 4-methoxy) that demonstrate equivalent bindingactivities as native insulin also demonstrate equivalent activity at theinsulin receptor (receptor activity determined using the assay describedin Example 4).

TABLE 4 Insulin Receptor Binding Activity of A19 Insulin Analogs InsulinReceptor % native ligand % native ligand Activity Activity Analogue IC₅₀STDev (in test) (0.60 nM) 4-OH (native 0.64 0.15 100.0 100.0 insulin)4-COCH₃ 31.9 9.47 0.6 1.9 4-NH₂ 0.31 0.12 203.0 193.5 0.83 0.15 103.072.3 0.8 0.1 94.0 75.0 4-NO₂ 215.7 108.01 0.3 1.3 3,4,5-3F 123.29 31.100.5 0.5 4-OCH₃ 0.5 0.50 173.0 120.0 3-OCH₃ 4.74 1.09 28.0 12.7 5.16 3.8818.0 11.6 4-OH, 3,5-2Br 1807.17 849.72 0.0 0.0 4-OH, 3,5-2 NO₂ 2346.2338.93 0.0 0.0

TABLE 5 Insulin Receptor Phosphorylation Activity of A19 Insulin AnalogsInsulin Receptor % native ligand Analogue EC₅₀ STDev Activity (in test)4-OH (native insulin) 1.22 0.4 100.0 4-NH₂ 0.31 0.14 393.5 4-OCH₃ 0.940.34 129.8

Example 9 INSULIN LIKE GROWTH FACTOR (IGF) ANALOG IGF1 (Y^(B16)L^(B17))

Applicants have discovered an IGF analog that demonstrates similaractivity at the insulin receptor as native insulin. More particularly,the IGF analog (IGF1 (Y^(B16)L^(B17)) comprises the native IGF A chain(SEQ ID NO: 5) and the modified B chain (SEQ ID NO: 6), wherein thenative glutamine and phenylalanine at positions 15 and 16 of the nativeIGF B-chain (SEQ ID NO: 3) have been replaced with tyrosine and leucineresidues, respectively. As shown in FIG. 4 and Table 6 below the bindingactivities of IGF1 (Y^(B16)L^(B17)) and native insulin demonstrate thateach are highly potent agonists of the insulin receptor.

TABLE 6 Insulin Standard IGF1(Y^(B16)L^(B17)) AVER. STDEV AVER. STDEVIC₅₀(nM) 1.32 0.19 0.51 0.18 % of Insulin Activity 100 262

Example 10 IGF Prodrug Analogs

Based on the activity of the A19 insulin analog (see Example 5), asimilar modification was made to the IGF1 A:B(Y^(B16)L^(B17)) analog andits ability to bind and stimulate insulin receptor activity wasinvestigated. FIG. 6 provides the general synthetic scheme for preparingIGF1 A:B(Y^(B16)L^(B17)) wherein the native tyrosine is replace with a4-amino phenylalanine [IGF1 A:B(Y^(B16)L^(B17))(p-NH₂—F)^(A19)amide] aswell as the preparation of its dipeptide extended analog [IGF1A:B(Y^(B16)L^(B17))^(A19)-AibAla amide], wherein a dipeptide comprisingAib and Ala are linked to the peptide through an amide linkage to theA19 4-amino phenylalanine. As shown in FIG. 7 and Table 7, the IGFanalog, IGF1 (Y^(B16)L^(B17)) A(p-NH₂—F)¹⁹ specifically binds to theinsulin receptor wherein the dipeptide extended analog of that analogfails to specifically bind the insulin receptor. Note the dipeptideextension lacks the proper structure to allow for spontaneous cleavageof the dipeptide (absence of an N-alkylated amino acid at the secondposition of the dipeptide) and therefore there is no restoration ofinsulin receptor binding.

IGF A:B(Y^(B16)L^(B17)) insulin analog peptides comprising a modifiedamino acid (such as 4-amino phenylalanine at position A19) can also besynthesized in vivo using a system that allows for incorporation ofnon-coded amino acids into proteins, including for example, the systemtaught in U.S. Pat. Nos. 7,045,337 and 7,083,970.

TABLE 7 IGF1(Y^(B16)L^(B17)) IGF1(Y^(B16)L^(B17)) Insulin Standard(p-NH₂—F)^(A19)amide (AibAla)^(A19)amide AVER. STDEV AVER. STDEV. AVER.STDEV IC₅₀(nM) 0.24 0.07 1.08 .075 No Activity % of 100 22 InsulinActivity

A further prodrug analog of an IGF^(B16B17) analog peptide was preparedwherein the dipeptide prodrug element (alanine-proline) was linked viaan amide bond to the amino terminus of the A chain(IGF1(Y^(B16)L^(B17))(AlaPro)^(A-1,0)). As shown in Table 8, theIGF1(Y^(B16)L^(B17))(AlaPro)^(A-1,0) has reduced affinity for theinsulin receptor. Note, based on the data of Table 3, the dipeptideprodrug element lacks the proper structure to allow for spontaneouscleavage of the dipeptide prodrug element, and therefore the detectedinsulin receptor binding is not the result of cleavage of the prodrugelement.

TABLE 8 Insulin Standard IGF1(Y^(B16)L^(B17))(AlaPro)^(A-1, 0) AVER.STDEV AVER. STDEV. IC₅₀(nM) 0.72 0.09 1.93 .96 % of Insulin 100 37.12Activity

Example 11 Additional IGF Insulin Analogs

Further modifications of the IGF1 (Y^(B16)L^(B17)) peptide sequencereveal additional IGF insulin analogs that vary in their potency at theinsulin and IGF-1 receptor. Binding data is presented in Table 9 foreach of these analogs (using the assay of Example 3), wherein theposition of the modification is designated based on the correspondingposition in the native insulin peptide (DPI=des B26-30). For example, areference herein to “position B28” absent any further elaboration wouldmean the corresponding position B27 of the B chain of an insulin analogin which the first amino acid of SEQ ID NO: 2 has been deleted. Thus ageneric reference to “B(Y16)” refers to a substitution of a tyrosineresidue at position 15 of the B chain of the native IGF-1 sequence (SEQID NO: 3). Data regarding the relative receptor binding of insulin andIGF analogs is provided in Table 9, and data regarding IGF analogstimulated phosphorylation (using the assay of Example 4) is provided inTable 10.

TABLE 9 Receptor Binding Affinity of Insulin and IGF Analogues InsulinReceptor % native IGF-1 Receptor % insulin % activity nM insulinactivity IGF-1 (0.55 Analogue IC₅₀: STDev Date (in test) (0.6 nM) IC₅₀STDev Date (in test) nM) Ratio IGF-1 A: B 10.41 1.65 Sep. 4, 2007 5.85.8 IGF-1 A: B 0.66 0.36 May 22, 2007 58.7 90.9 7.85 1.98 Jun. 4, 20076.8 7.0 11.9 (E10Y16L17) 0.51 0.18 May 29, 2007 98.8 117.6 12.19 2.17Sep. 18, 2007 5.0 4.5 IGF-1 A: B 1.22 0.30 Mar. 20, 2008 36.5 50.0 17.502.25 Apr. 4, 2008 3.0 3.1 14.3 (E10Y16L17)- E31E32B-COOH IGF-1 A: B 0.260.02 Nov. 9, 2007 301.0 231.0 6.79 1.50 Apr. 4, 2008 7.7 8.1 (D10Y16L17)0.2 0.02 Dec. 4, 2007 380.1 300.0 DPI A-COOH 0.42 0.06 Jun. 5, 2008174.1 144.1 IGF-1 A: B 0.38 0.08 Aug. 10, 2007 51.1 157.9 22.89 5.26Sep. 18, 2007 3.3 2.4 60.2 (E10Y16L17) DPI IGF-1 A: B 0.16 0.07 Nov. 9,2007 479.0 4.66 0.77 Apr. 4, 2008 11.2 11.8 29.1 (H5D10Y16L17) DPI IGF-1A: B 0.25 0.04 Nov. 9, 2007 316.0 (H5D10Y16L17) (S═O) DPI IGF-1 A (H8 A9N21): 0.05 0.01 Dec. 4, 2007 1576.7 4.03 0.50 Apr. 4, 2008 12.9 13.680.6 B(H5D10Y16L17) 0.09 0.02 Dec. 14, 2007 1667.0 DPI A-COOH IGF-1 A(H8 A9 N21): 0.12 0.02 Dec. 14, 2007 1171.4 22.83 3.53 Apr. 4, 2008 2.32.4 190.3 B (H5D10Y16L17 A22) DPI A-COOH IGF-1 A (H8 A9 N21): 0.36 0.10Dec. 14, 2007 400.7 B(H5D10Y16L17A22) (S═O) DPI A-COOH IGF-1 A: IGF-1 B(1-8)- 1.59 0.62 May 22, 2007 19.1 37.7 131.30 58.05 Jun. 4, 2007 0.30.4 22.6 In (9-17)-IGF- 1 B (18-30) IGF-1 A: In (1-17)-IGF- 2.77 1.19May 22, 2007 14.0 21.7 62.50 30.28 Jun. 4, 2007 0.9 0.9 22.6 1 B (18-30)2.67 0.67 May 18, 2007 11.3 22.5 2.48 1.35 May 29, 2007 20.1 24.2 IGF-1A: In B (1-5)- 0.31 0.19 Aug. 20, 2007 62.4 193.5 27.54 6.57 Sep. 25,2007 3.6 2 88.8 IGF-1 B (YL) (6-30) IGF-2 native 13.33 1.85 Sep. 25,2007 7.5 4.5 IGF-2 AB IGF-2 AB (YL) 6.81 3.81 Oct. 10, 2007 8.4 8.8 InA: IGF-1 B (YL) 82.62 31.75 Sep. 4, 2007 0.9 0.7 107.24 65.38 Sep. 4,2007 0.7 0.6 In A-IGF-2 D: 0.53 0.11 Sep. 4, 2007 141.0 113.0 1.59 0.34Sep. 18, 2007 47.6 34.6 In B-IGF-2 C 0.37 0.05 Oct. 13, 2007 179.1 162.214.69 3.02 Sep. 25, 2007 6.8 3.7 39.7 **All C terminals are amides (DPI)unless specified otherwise

TABLE 10 Total Phosphorylation by IGF-1 & IGF-2 Analogues InsulinReceptor IGF-1 Receptor Selective Analogue EC50: STDev Date % InsulinEC50: STDev Date % IGF Ratio Insulin 1.26 0.098 Dec. 14, 2007 114.8846.66  Jan. 23, 2008 90.89 1.43 0.72 Apr. 1, 2008 86.02 29.35  May 20,2008 1.12 0.11 Mar. 31, 2008 1.53 0.13 Apr. 11, 2008 2.70 0.71 Apr. 16,2008 1.22 0.40 May 20, 2008 IGF-1 54.39  21.102 Dec. 14, 2007 2.3 0.870.16 Jan. 23, 2008 100  0.02 0.49 0.13 May 20, 2008 0.97 0.48 Jul. 23,2008 IGF-1 AB IGF-1 A: B (E10Y16L17) 2.57 0.59 Mar. 31, 2008 49.2 7.425.59 Jul. 23, 2008 13 IGF-1 A: B (E10 Y16L17)- 7.00 2.82 Mar. 31, 200818.1 E31E32 B—COOH 8.52 4.34 Apr. 16, 2008 31.7 IGF-1 AB (D10Y16L17) DPI0.08 0.006 Dec. 14, 2007 1575 0.78 0.17 Jan. 23, 2008 111.538  9.75A—COOH 4.38 2.98 Apr. 16, 2008 ?? IGF-1 AB (E10Y16L17) DPI IGF-1 AB(H5D10Y16L17) DPI 12.22 5.46 Jan. 23, 2008 7.1 IGF-1 AB (H5D10Y16L17)(S═O) DPI IGF-1 A (H8 A9 N21) B 0.15 0.054 Dec. 14, 2007 840 0.43 0.44Jan. 23, 2008 181.395  2.81 (H5D10Y16L17) DPI A—COOH 0.25 0.2 Apr. 16,2008 1080 IGF-1 A (H8 A9 N21) 0.35 0.064 Dec. 14, 2007 360 11.26 2.55Jan. 23, 2008 7.7 32.54 B (H5D10Y16L17A22) DPI A—COOH 0.44 0.17 Apr. 16,2008 614 IGF-1 A (H8 A9 N21) 0.72 0.098 Dec. 14, 2007 B (H5D10Y16L17A22)(S) DPI A—COOH *All C-terminals are amides unless specified otherwise.

Example 12 Dipeptide Half Life on IGF1 Dipeptide Extended(p-NH₂—F)^(A19)Amide Analogs

The cleavage of an (pNH2-Phe) amide linked dipeptide AibPro from variousIGF-1 peptides was measured to determine the impact of the peptidesequence or heteroduplex on the dipeptide cleavage. Results for thetested peptides is shown in Table 12 and the data reveals that theIGF1-A chain alone represents a good model for the study of prodrug halflife for IGF1 B:A (Y^(B16)L^(B17)) peptides.

TABLE 12 Parent Peptide Half Life (hr)IGF1A(Ala)^(6, 11, 20)(pNH₂-Phe)^(A19) 2.2IGF1A(Acm)^(6, 11, 20)(pNH₂-Phe)^(A19) 1.8 IGF1B:A(S-S)^(A7, B7)(Acm)^(A6, 11, 20, B19)(pNH₂-Phe)^(A19) 1.8 IGF1B:A(pNH₂-Phe)^(A19) 1.6

Comparison of prodrug analogs of the IGF A-chain relative to thedisulfide bound A chain and B chain construct (IGF1 A:B(Y^(B16)L^(B17)))revealed the two compounds had similar half lives for the prodrug form.The AibAla analog does not cleave and thus is not a prodrug, but servesto show the modification can inactivate the insulin analog IGF1A:B(Y^(B16)L^(B17))(p-NH₂—F)^(A19)amide. Accordingly, the IGF1A chainalone was determined to be a good model for the study of pro-drug halflife on IGF1 B:A (Y^(B16)L^(B17)) analog peptides. The AibAla analogdoes not cleave and thus is not a prodrug, but serves to show themodification can inactivate the insulin analog IGF1A:B(Y^(B16)L^(B17))(p-NH₂—F)^(A19)amide. For simplicity, prodrug halflives were determined using only the IGF1 A chain in the absence of theB chain. The half lives of each propeptide was determined as describedin Example 5. The data is presented in Table 13:

TABLE 13 Dipeptide half life on IGF1 dipeptide extended (p-NH₂—F)^(A19)amide analogs Dipeptide Half Life (hr) Aib Pro 2.2 AibOH Pro 165.0 AibdPro 1.9 AibOH Sar 2.3 dK(acetyl) Sar 16.3 K Sar 21.8 K(acetyl) N-methylAla 23.6 dK(acetyl) N-methyl Ala 35.3

The data shows that by altering the substituents on the dipeptideprodrug element that the half life of prodrug can be varied from 2 hrsto >100 hrs.

Additional prodrug analog peptides were prepared using anIGF1-A(pNH2-F)¹⁹ base peptide and altering the amino acid composition ofthe dipeptide prodrug element linked through the 4-amino phenylalanineat position A19. Dipeptide half lives were measured for differentconstructs both in PBS and in 20% plasma/PBS (i.e. in the presence ofserum enzymes. The results are provided in Table 14. The resultsindicate that three of the four peptides tested were not impacted byserum enzymes.

TABLE 14 Dipeptide half life on IGF1-A(pNH2—F)¹⁹ Half Life (hr) 20% PBSPlasma/PBS Aib Pro 2.2 2.1 Aib dPro 2.1 2.2 AibOH Sar 2.3 dK N-isobutylGly 4.4 4.1 dK N-hexyl Gly 10.6 dK(acetyl) Sar 17.2 K Sar 21.8 5.9K(acetyl) N-methyl Ala 23.6 dK(acetyl) N-methyl Ala 35.3 AibOH Pro 165.0K(acetyl) Azetidine-2-carboxylic acid Not cleavable dK(acetyl)Azetidine-2-carboxylic acid Not cleavable

Example 13 Receptor Binding of IGF^(B16B17) Analog Peptides Over Time

Prodrug formulations of IGF^(B16B17) Analog Peptides were prepared andtheir degradation over time was measured using the insulin receptorbinding assay of Example 3. Peptides used in the assay were prepared asfollows:

Dipeptide-IGF1A Analogs

If not specified, Boc-chemistry was applied in the synthesis of designedpeptide analogs. Selected dipeptide H₂N-AA1-AA2-COOH was added to(pNH₂-Phe)¹⁹ on IGF1A (Ala)^(6,7,11,20). The IGF-1 A chain C-terminaltripeptide Boc(Fmoc-pNH-Phe)-Ala-Ala was synthesized on MBHA resin.After removal of Fmoc by the treatment with 20% piperidine/DMF at roomtemperature for 30 minutes, Fmoc-AA2 was coupled to the p-amino benzylside chain at A19 by using a threefold excess of amino acid, PyBop, DIEAand catalytic amount of pyridine. The Boc-synthesis of the remainingIGF-1 A chain (Ala)^(6,7,11,20) sequence was completed using an AppliedBiosystems 430A Peptide Synthesizer, yielding IGF-1 A chain(Boc)⁰(Ala)^(6,7,11,20)(Fmoc-AA2-pNH-Phe)¹⁹-MBHA. After the Fmoc groupwas removed from the N-terminus of AA2, Boc-AA1 was then coupled to theamine using threefold excess of amino acid, DEPBT and DIEA. Removal ofthe two Boc groups remaining on the A chain by TFA was followed by HFcleavage, yielding IGF-1 A-chain(Ala)^(6,7,11,20)(H₂N-AA1-AA2-pNH-Phe)¹⁹amide. In the case of AA1 beingd-lysine, acetylation on the 8-amine was performed prior to Boc removal.Dipeptide-IGF-1 A chain analogs were purified by semi-preparativeRP-HPLC and characterized by analytical RP-HPLC and MALDI massspectrometry.

Dipeptide-IGF-1 (YL) Analogs

A selected dipeptide H₂N-AA1-AA2-COOH was added to (pNH₂-Phe)¹⁹ on IGF-1A chain (Acm)⁶′¹¹′²⁰ as described immediately above except PAM resin wasused for the synthesis of IGF-1 A chain to yield a C terminal acid uponHF-cleavage. IGF-1 B chain (Y^(B16)L^(B17))(Acm)¹⁹ was synthesized onMBHA resin to yield a C terminal amide. The free thiol on Cys^(B7) wasmodified by Npys through reaction with DTNP at a 1:1 molar ratio in 100%DMSO. Purified dipeptide-IGF-1 A chain and IGF-1 B chain(Y^(B16)L^(B17)) analogs were assembled using the “1+2” two step chaincombination strategy illustrated in Scheme 1. Intermediate and finalpurifications were performed on semi-preparative RP-HPLC andcharacterized by analytical RP-HPLC and MALDI mass spectrometry.

The IGF^(B16B17) analog peptide prodrugs were incubated in PBS, pH 7.4at 37° C. and at predetermined time intervals an aliquot was taken andfurther degradation was quenched with 0.1% TFA and the aliquot wassubjected to analytical HPLC analysis. Peaks a and b, representing theprodrug and active forms of the IGF^(B16B17) analog peptide wereidentified with LC-MS and quantified by integration of peak area anHPLC. FIGS. 9A-9C show the output of an HPLC analysis of the degradationof the IGF^(B16B17) analog peptide prodrug:IGF1A(Ala)^(6,7,11,20)(Aib-Pro-pNH—F)¹⁹. Aliquots were taken at 20minutes (FIG. 9A), 81 minutes (FIG. 9B) and 120 minutes (FIG. 9C) afterbeginning the incubation of the prodrug in PBS. The data indicate thespontaneous, non-enzymatic conversion ofIGF1A(Ala)^(6,7,11,20)(Aib-Pro-pNH—F)¹⁹amide toIGF1A(Ala)^(6,7,11,20)(pNH₂—F)¹amide over time.

The degradation of the prodrug forms of IGF^(B16B17) analog peptides totheir active form was also measured based on the compounds ability tobind to the insulin receptor as measured using the in vitro assay ofExample 3. FIGS. 10A & 10B are graphs depicting the in vitro activity ofthe prodrug Aib,dPro-IGF1YL (dipeptide linked through the A194-aminoPhe). FIG. 10A is a graph comparing relative insulin receptorbinding of native insulin (measured at 1 hour at 4° C.) and the A19 IGFprodrug derivative (Aib,dPro-IGF1YL) over time (0 hours, 2.5 hours and10.6 hours) incubated in PBS. FIG. 10B is a graph comparing relativeinsulin receptor binding of native insulin and the A19 IGF prodrugderivative (Aib,dPro-IGF1YL) over time (0 hours, 1.5 hours and 24.8hours) incubated in 20% plasma/PBS at 37° C. As indicated by the datapresented in the graph, increased activity is recovered form the A19 IGFprodrug derivative sample as the prodrug form is converted to the activeIGF1YL peptide. The activity of the IGF^(B16B17) analog peptides wasmeasured relative to insulin receptor binding, and since the underlyingIGF^(B16B17) analog peptides have more activity than native insulin,activity of greater than 100% relative to insulin is possible.

FIGS. 11A & 11B are graphs depicting the in vitro activity of theprodrug dK,(N-isobutylG)-IGF1YL (dipeptide linked through the A194-aminoPhe). FIG. 11A is a graph comparing relative insulin receptorbinding of native insulin (measured at 1 hour at 4° C.) and the A19 IGFprodrug derivative (IGF1YL: dK,(N-isobutylG) over time (0 hours, 5 hoursand 52 hours) incubated in PBS. FIG. 11B is a graph comparing relativeinsulin receptor binding of native insulin and the A19 IGF prodrugderivative (IGF1YL: dK,(N-isobutylG) over time (0 hours, 3.6 hours and24.8 hours) incubated in 20% plasma/PBS at 37° C. As indicated by thedata presented in the graph, increased activity is recovered form theA19 IGF prodrug derivative sample as the prodrug form is converted tothe active IGF1YL peptide.

FIGS. 12A & 12B are graphs depicting the in vitro activity of theprodrug dK(e-acetyl),Sar)-IGF1YL (dipeptide linked through the A194-aminoPhe). FIG. 12A is a graph comparing relative insulin receptorbinding of native insulin (measured at 1 hour at 4° C.) and the A19 IGFprodrug derivative (IGF1YL: dK(e-acetyl),Sar) over time (0 hours, 7.2hours and 91.6 hours) incubated in PBS. FIG. 12B is a graph comparingrelative insulin receptor binding of native insulin and the A19 IGFprodrug derivative (IGF1YL: dK(e-acetyl),Sar) over time (0 hours, 9hours and 95 hours) incubated in 20% plasma/PBS at 37° C. As indicatedby the data presented in the graph, increased activity is recovered formthe A19 IGF prodrug derivative sample as the prodrug form is convertedto the active IGF1YL peptide.

Example 14 Biosynthesis and Purification of Single Chain Insulin Analogs

An insulin-IGF-I minigene comprising a native insulin B and A chainlinked via the IGF-I C chain)(B⁰-C¹-A⁰ was cloned into expression vectorpGAPZα A (purchased from Invitrogen) under GAP promoter (promoter of theglyceraldehyde-3-phosphate dehydrogenase (GAPDH)) for constitutiveexpression and purification of recombinant protein in yeast Pichiapastoris. The minigene was fused to an N-terminal peptide encodingSaccharomyces cerevisiae α-mating factor leader signal for secretion ofthe recombinant protein into the medium. A Kex2 cleavage site betweenthe minigene and the leading α-mating factor sequence was used to cleavethe leader sequence for secretion of the minigene with native aminotermini. Single-site alanine mutations were introduced into C peptide atpositions 1 (G1A), 2 (Y2A), 3 (G3A), 4 (S4A), 5 (S5A), 6 (S6A), 7 (R7A),8 (R8A), 10 (P10A), 11 (Q11A), and 12 (T12A) of the B⁰C¹A⁰ minigene.

The minigenes including B⁰C¹A⁰, eleven alanine mutants, and other selectderivatives were transformed into yeast Pichia pastoris byelectroporation. Positive transformants were selected on minimalmethanol plates and a genomic preparation of each Pichia isolate wasperformed and integration of the constructs into the yeast genome wasconfirmed by PCR. An 833 base pair PCR product was visualized on anagarose DNA gel. The insulin analogs were produced by fermentation of acorresponding yeast line. The yeast cells were pelleted bycentrifugation at 5 K for 20 minutes in 500 ml Beckman centrifuge tubesand the media was kept for subsequent protein purification.

Growth media supernatants were filtered through 0.2 μm Millipore filter.Acetonitrile (ACN) was added to the supernatant to a final volume of20%. The supernatant was purified over a Amberlite XAD7HP resin fromSigma, pre-equilibrated with 20% aqueous ACN. The resin was then rinsedtwice with 30 ml of 20% aqueous ACN and contaminants were removed with30% aqueous ACN containing 0.1% TFA. Partially purified insulin analogswere eluted from the column with 54% aqueous ACN containing 0.1% TFA andlyophilized. Lyophilized samples were re-suspended in 0.025M NH₃HCO₃ pH8 and purified on a Luna C18 column (10 μm particle size, 300A⁰ poresize). Protein was eluted from the column using a linear gradient of20-60% aqueous ACN. MALDI-MS positive fractions were pooled andtransferred to a disposable scintillation vial for subsequentlyophilization. Lyophilized samples were then resuspended in 20% aqueousACN containing 0.1% TFA, and purified on a Luna C18 column (10 μmparticle size, 300A⁰ pore size). The protein was eluted from the columnusing a linear gradient of 18-54% aqueous ACN with 0.1% TFA. Proteinelution was monitored at an absorbance 280 nm. MALDI-TOF MS positivefractions were analyzed via a C8 analytical column to insure purity.

FIG. 15 illustrates the potency of the single-chain insulin analogs. TheB⁰-C¹A⁰ analog demonstrated potency that was equally effective at bothinsulin receptor isoforms and the IGF-1 receptor. Mutation of thetyrosine at position 2 to alanine or the shortening of the C-peptide toeight amino acids through deletion of C9-12 provided a selectiveenhancement in the specificity of insulin action by significantreduction in the IGF-1 receptor activity. See also the data provided inTables 15A and 15B:

TABLE 15A Insulin Binding & Phosphorylation Analysis (B⁰C¹A⁰) InsulinBinding Insulin Phosphorylation Peptide IC₅₀, nM n EC₅₀, nM n Insulin0.54 ± 0.02 4 1.67 ± 0.13 1 IGF-1 18.81 ± 1.77  3 29.20 ± 8.41  1 010(B⁰C¹A⁰) 2.83 ± 0.52 2 1.93 ± 0.43 1 G1A 1.21 ± 0.15 1  2.4 ± 0.24 1 Y2A1.95 ± 0.28 3 1.86 ± 0.42 1 G3A 1.41 ± 0.05 2 2.13 ± 0.02 1 S4A 0.84 ±0.47 2 0.76 ± 0.35 1 S5A 0.93 ± 0.44 1 2.23 ± 1.27 1 S6A 1.15 ± 0.24 12.33 ± 1.65 2 R7A 6.04 ± 0.82 1 5.21 ± 4.14 1 R8A 0.63 ± 0.09 1 2.03 ±0.06 2 P10A 2.86 ± 0.93 1 2.59 ± 1.2  1 Q11A 1.79 ± 0.47 1 2.58 ± 0.83 1T12A  1.2 ± 0.18 1 2.83 ± 1.31 1

TABLE 15B IGF-1 Binding & Phosphorylation Analysis (B⁰C¹A⁰) IGF-1Binding IGF-1 Phosphorylation Peptide IC₅₀, nM n EC₅₀, nM n Insulin60.63 ± 4.43  1 48.66 ± 1.59  1 IGF-1 0.38 ± 0.07 1 0.88 ± 0.41 1 010(B⁰C¹A⁰) 4.49 ± 1.04 1 1.29 ± 2.28 1 G1A 42.36 ± 16.24 1  1.4 ± 0.62 1Y2A 257.9 ± 29.59 1  35.6 ± 14.55 1 G3A 34.02 ± 16.09 1 7.85 ± 0.78 1S4A 15.30 ± 3.10  1 1.64 ± 1.65 1 S5A 13.06 ± 3.01  1 2.63 ± 1.88 1 S6A2.44 ± 0.79 1 1.54 ± 0.62 2 R7 43.86 ± 8.72  1 1.26 ± 1.55 1 R8 10.85 ±1.47  1 0.50 ± 0.23 2 P10A 6.42 ± 0.47 1 2.79 ± 1.12 1 Q11A 4.23 ± 0.431 0.41 ± 0.69 1 T12A 9.15 ± 0.83 1 1.44 ± 1.36 1

FIG. 16 demonstrates that position 2 and 3 in the C-peptide are mostsensitive to modification at the IGF-1 receptor with the insulinreceptor proving to be relatively immune to modification. Finally, FIGS.17 and 18 present the in vitro analysis of the single-chain insulinmutants as a ratio of binding affinity (IC50) and biochemical signalingthrough tyrosine phosphorylation (EC50). The two independentmeasurements demonstrate great consistency thereby validating this invitro approach to structure-function analysis. All of the analogsmaintained single unit nanomolar activity with certain specific analogsproving to be slightly enhanced in potency (low single unit nanomolar).The most insulin selective analogs were those that we missing the lastfour residues of the C-peptide, had an alanine mutation at position twoof the C-peptide, or a combination of the two changes.

Example 15 Synthesis and Characterization of Single Chain InsulinAnalogs Linked by Mini-Peg

A series of single chain insulin analogs were prepared by solid-phasesynthesis using a two-step native chain ligation approach. The initialpeptide was a linear construct where the N-terminus started at CysB19and continued through to AsnA21 with a short linear polymer of ethyleneglycol serving as a connection from the C-terminus of the last B-chainamino acid to the N-terminus of the first amino acid of the A-chain,typically glycine. The N-terminal end of the B-chain (which typicallystarts with the first N-terminal amino acid of the final insulin analogand ends with amino acid 18 of the B-chain, typically valine) wasfragment-coupled to the single linear peptide. Once coupled bythiol-assisted native chain ligation, the peptide was purifiedchromatographically, converted to the correct disulfide isomer andpurified once more by high performance chromatography. All insulinanalogs were analyzed for purity by HPLC and MS analysis.

FIG. 19 provides a schematic overview of the synthetic design with asingle example of using PEG8 as a linker. The same approach was employedto synthesize analogs of shorter and longer length as well as those ofvariable length obtained by the use of more than one mini-peg covalentlylinked in linear fashion as an amide.

FIGS. 20-24 provide in vitro experimental results obtained through thestudy of the single chain insulin analogs linked by a mini-peg ofdefined length at a specific location. FIG. 20 illustrates that the useof the minipeg of 4, 8, or 16 ethylene glycol units yielded poor potencyinsulin analogs of less than 5% activity relative to the native hormoneas measured by binding or biochemical signaling. These results are lessimpressive than the potency we had previously observed for singleinsulin analogs where peptides varying in length of 8-12 amino acidswere used to linearly couple the C-terminus of the B-chain with theN-terminus of the A-chain (see FIGS. 14-18).

Table 16 and FIG. 21 demonstrate a dramatic increase in potency when thesame size mini-peg linkers were used to couple the C-terminus of ashortened B-chain to the N-terminus of the A-chain. The des-V (missingamino acids B26-30) insulin analog once coupled with the mini-pegs werecompetitively potent with native hormone, more than a tenfold increaserelative to the full length B-chain analogs.

TABLE 16 Phosphorylation Activity of mini-PEG linked Single ChainInsulin Analogs at Insulin and IGF-1 receptors Insulin IGF-1 % Insulin n% Insulin n PEG 4 5.69 3 0.44 2 PEG 8 7.44 5 1.21 4 PEG 16 5.17 3 0.16 2No PEG DesV 0.04 1 0 1 k-PEG 4 DesV 2.37 2 0.16 2 PEG 8 DesV 91.2 5 2.435 PEG 12 DesV 179 3 4.51 3 PEG 16 DesV 83.3 3 1.39 3

FIG. 22 demonstrates the competitive performance of two-chainheteroduplex native insulin with single chain insulin analogs where theA and B chains were either largely derived from insulin or IGF-1. Thepeptides were found to be of comparable bioactivity. FIG. 23 is afurther analysis using a peg-12 linker in the des-V format of theB-chain with largely IGF-1 based sequence, but including the B16,17 YLchange to enhance insulin receptor activity. The results clearlydemonstrate that very high potency insulin analogs can be obtainedthrough the use of histidine at position A8. Residual IGF-1 receptoractivity is slightly greater than native insulin. FIG. 24 shows thatthere is a linear correlation of insulin receptor activity to IGF-1receptor activity which is consistent with the type of relationshipknown for structure-function analysis performed with the two-chainnative insulin structure.

FIG. 25 demonstrates that the cleavage of insulin analogs by theinsulin-specific degrading enzyme (IDE) is extremely robust and easilydetected in those insulin analogs where the A-chain is derived from thenative insulin sequence. In contrast those analogs where the A-chain isderived from the sequence of IGF-1 appear to be extremely resistant toproteolysis. The prospect that the increased stability might engenderincreased mitogenicity was explored and the results are reported in FIG.26. There did not appear to be a correlation of the higher insulinpotency analogs with increased proliferation. Furthermore, and ofspecific importance to proteolytic stability the analogs that were moreresistant to IDE did not appear to be of any greater mitogenicpotential.

A comparative analysis was conducted on single chain analogs using PEGchain linkers to measure how different sized PEG linking moieties impactin vitro activities at the at the insulin and IGF-1 receptors asmeasured by receptor signaling through phosphorylation. The datapresented in FIG. 27 reveals that a PEG₁₂DesV construct (wherein the 5carboxy terminal amino acid of the B chain have been deleted) providesthe most potent compounds.

A single chain analog was constructed comprising a PEG₁₂ and a singleamino acid (glycine or lysine) as the linking moiety, linking a DesV Bchain to the native insulin A chain. Comparative analysis of singlechain peg/amino acid-linked analogs in vitro activities at the insulinand IGF-1 receptors as measured by receptor binding and receptorsignaling through phosphorylation revealed the peg/amino acid-linkedanalogs were potent insulin receptor agonists (see FIG. 28A). Similarlythe addition of two lysine residues to the linking moiety (single chainpeg/(lysine)₂-linked analog) produced a potent single chain peg/aminoacid-linked insulin receptor agonist (see FIG. 28B) as measured byreceptor binding and receptor signaling through phosphorylation.

FIG. 29A-E provides data of mice administered various single chaininsulin analogs. FIG. 29A provides in vitro comparative analysis ofsingle chain peg-linked analogs activities at the insulin receptor asmeasured by receptor binding and receptor signaling throughphosphorylation. The activity of the compounds in vivo was also testedby administering the compounds to mice. Food was removed four hoursprior to administering peptide to normal mice and withheld for theduration of the study. Glucose was measured just prior to administrationof test compounds and at 1, 2, 3, 6 and 8, 12, 16, 20 and 24 hours afteradministration. All insulin analogs were administered subcutaneously ina volume of 10 ul/gm of body weight. FIGS. 29B and 29C provide data onblood glucose concentrations over 8 hours after administration of thelisted analogs. FIGS. 29D and 29E provide data on blood glucose AUCvalues after administration of the listed analogs.

Analysis of blood glucose during an Insulin Tolerance Test (ITT) in micerevealed that a peg-12 linker in the des-V format of the B-chain witheither an insulin sequence or a largely IGF-1 based sequence, butincluding the B16,17 YL produced equivalent results as human insulin.See FIG. 27. Accordingly the single chain insulins linked with a PEGlinking moiety function in vivo.

Example 16 Acylated Insulin Analogs

Comparative insulin tolerance tests were conducted on mice comparing theability of human insulin relative to three different acylated insulinanalogs to reduce and sustain low blood glucose concentration. Thecompounds were tested at two different concentrations (27 nmol/kg and90nmol/kg). The acylated insulins included MIU-41 (a two chain insulinanalog having a C16 acylation via a gamma glutamic acid linker attachedto a lysine residue located at position A14), MIU-36 (a two chaininsulin analog having a C16 acylation linked to the N-terminus of the Bchain) and MIU-37 (a two chain insulin analog having a C16 acylation viaa gamma glutamic acid linker attached to a lysine residue located atposition B22). All three acylated insulin analogs provided a more basaland sustained lowered glucose levels relative to native insulin, evenafter 8 hours (See FIG. 30A-30D).

FIGS. 31A-31D show the results of comparative insulin tolerance testsconducted on mice comparing the ability of the commercially availableacylated insulin analog (Detemir) to the acylated two chain insulinagonist MIU-55. MIU-55 [B¹(H5,10,Y16,L17,C16rE-K22)25a: A¹(N18,N21)] hasthe C-terminal 5 amino acids of the B chain deleted and is acylated witha C14 fatty acid (myristoylic acid) through a gamma Glu linker at theε-amino group of Lys B29. The results indicate that MIU-55 is about onethird as potent as Detemir (See FIGS. 31A and 31B). The data indicatethat the acylated forms of insulin are longer acting than thenon-acylated forms and that MIU-55 while less potent than Detemir,exhibits a similar profile as Detemir. FIGS. 31C and 31D provide data onblood glucose AUC values after administration of the listed analogs. Acomparison of Detemir and MIU-49 in insulin tolerance tests revealedsimilar results (see FIGS. 32A-32D. MIU-49[B¹(C16-rE0,H5,Aib9,H10,E13-K17,Y16)25a: A¹(N18,N21)] is a two chaininsulin agonist having the C-terminal 5 amino acids of the B chaindeleted and acylated with a C16 fatty acid at the through a gamma Glulinker at the α-amino group of Gly B2. Again, the data shows that MIU-49is about one third as potent as Detemir (See FIGS. 32A and 32B thatMIU-49 while less potent than Detemir, exhibits a similar profile asDetemir.

Example 17 Pegylated Insulin Analogs

Various pegylated insulin analogs were prepared and tested in vitro.Table 17 shows the percent activity of each analog relative to nativeinsulin.

TABLE 17 Pegylated IGF-1 and Insulin Analogs % Insulin Activity MIU #Name IR-B IR-A IGF-1 R MIU-35 B¹(H5, H10, Y16, L17)25- 17.4 61.4 3.2C¹-A¹ (H8, N18, N21) MIU-56 C8-PEG20K B¹(H5, Y16, L17)25- 14.8PEG8-K-PEG4-A¹(N18, N21) MIU-57 B1-PEG20K MIU-35 1.1 3.1 1.2 MIU-58B2-PEG20K-B2- Dimer MIU-35 5.8 19.7 2.6 MIU-59 B1-PEG20K insulin 11.717.3 0.3 MIU-60 B29-PEG20K, B1, A1- insulin 2.7 2.4 <<0.3 NH₂CO MIU-61B1, B29, A1-tri-PEG5K insulin <0.1 0.2 <<<0.3 MIU-66 B1-PEG20K, A1-NH₂COinsulin 2.9 3.0 <0.3 MIU-67 B2, C8-PEG10K di- MIU-35 0.1 0.2 <0.1PEGylated MIU-68 B2, B22-PEG10K di- MIU-35 0.1 0.4 <0.1 PEGylated MIU-69B2, A14-PEG10K di- MIU-35 0.5 1.0 <0.1 PEGylated MIU-1 Insulin Standard100 100 1.77

Comparative insulin tolerance tests were conducted on mice comparing theability of the acylated insulin analog Detemir relative to the pegylatedsingle chain insulin analog MIU-56:B¹(H5,Y16,L17)25-PEG8-K-PEG4-A¹(N18,21). This single chain analogcomprises a 20 kDa PEG linked to the side chain of the single lysineresidue in the linking moiety (PEG8-K-PEG4) that joins the A chain andthe B chain. As shown in FIG. 33A-33D, the pegylated analog has asustained duration of action for 24 hours and its onset is gradualenough to avoid sedation of animals at the dosage required for sustainedaction through 24 hours. FIGS. 33C and 33D show the blood glucoseAUC_(24 hrs) in mice administered Detemir and MIU-56, respectively.Similar results were obtained for another pegylated single chain insulinanalog MIU-57 (see FIGS. 34A-34D). MIU-57 is an insulin single chainanalog (B¹(H5,Y16,L17)25-C¹-A¹(N18,21) comprising a 20 kDa PEG linked tothe N-terminal amine of the B chain. FIGS. 34A and 34B show the resultsof a comparative insulin dose titration of single chain insulin analogspegylated at the linking moiety (MIU-56) or pegylated at the N-terminalamine of the B chain (MIU-57), respectively. FIGS. 34C and 34D show theblood glucose AUC_(24 hrs) in mice administered MIU-56 and MIU-57,respectively. The data shows these analogs remain potent and have animproved therapeutic index relative to native insulin. Results fromcomparative insulin dose titrations of MIU-56 and MIU-57 reveal that asimilar profile is obtained in mice for dosages ranging from 20 nmol/kgthrough 80 nmol/kg (see FIGS. 34E and 34F)

A dimer (MIU 58) was prepared comprising two insulin single chainanalogs (B¹(H5,Y16,L17)25-C¹-A¹(N18,21) linked head to head via a 20 kDaPEG chain. FIGS. 34G-34J represents the results obtained from acomparative insulin tolerance test for MIU-57 and MIU-58 using C57/Blkmice. FIGS. 34G and 34H are graphs showing the results of insulintolerance tests comparing MIU-57 and MIU-58. FIGS. 34I and 34J show theblood glucose AUC_(24 hrs) in mice administered MIU-57 and MIU-58,respectively. The dimer is less potent than the parent compound, but isstill active.

FIGS. 35A and 35B provide data from a comparative insulin dose titrationof two pegylated native insulin heterodimers. The analogs comprise twonative insulin A and B chain sequences linked via the native disulfidelinkages, and modified to have either a 20 kDa PEG linked at theN-terminus of the B chain or at position B29 (with the amino terminus ofthe A and B chain carbamylated). The data shows that while thesecompounds differ slightly in their in vitro activities (See Table 17),they behave similarly in vivo in mice. Both compounds remain potent andhave an improved therapeutic index relative to non-pegylated nativeinsulin (slow onset, sustained activity for 24 hours and relativeflatness of the response).

Insulin analogs were also constructed having two or more covalentlylinked polyethylene glycol chains and compared to a native insulinanalog having a single 20 kDa PEG linked to its N-terminus. Moreparticularly, the activities of a single chain insulin analog(B¹(H5,Y16,L17)25-C¹(K8)-A¹(N18,21)) having two PEG chains (10K each)linked at the N-terminal alpha amine and at amino acid 8 of the linkingmoiety (position C8), a single chain insulin analog (B¹(H5,Y16,L17,K22)25-C¹(K8)-A¹(N18,21)) having two PEG chains (10K each) linked at theN-terminal alpha amine and at amino acid B22, and a single chain insulinanalog (B¹(H5,Y16,L17)25-C¹(K8)-A¹(K14, N18,21)) having two PEG chains(10K each) linked at the N-terminal alpha amine and at amino acid A14were compared. FIGS. 36A-36D provide data from a comparative insulindose titration of the three pegylated insulin analogs relative to thesingle pegylated native insulin derivative. The activity in vitro isdramatically reduced by at least 10×. However, while the in vivo datashows that some potency is lost, the double pegylated insulin analogsare still effective (particularly for the analog pegylated at thelinking moiety). More particularly, MIU-67 at 80 nmol/kg is roughlyequivalent to MIU-59 administered at 40 nmol/kg in reducing bloodglucose levels at least up to 8 hours after administration. Accordingly,insulin analogs can be prepared having two PEG chains of 10 kDa inlength that will provide an improved therapeutic index relative tonon-pegylated insulin analogs. In addition pegylation at the linkingmoiety of single chain analogs appears to be a preferred site ofpegylation.

Diabetic mice (db/db mice) were administered pegylated insulin analogsto compare their efficacy to commercially available insulin analogs. Inparticular, insulin analogs Levemir and Humulin were compared to thepegylated insulin analogs MIU-59 (native insulin analog having a single20 kDa PEG linked to its N-terminus) and MIU-66 (native insulin analoghaving a single 20 kDa PEG linked to its B chain N-terminus, and havingthe amino terminus of the A chain carbamylated). Although the in vitrodata (see Table 17) indicated that MIU-66 was far less potent thanMIU-59, MIU-59 and MIU-66 behave similarly in vivo, and both haveimproved activity relative to Levemir and Humulin (see FIGS. 37A and37B).

In summary, pegylation of insulin analogs, whether using an insulinbased or IGF based peptide backbone, in vivo, provides for a moreextended duration of action and a basal profile in the absence ofhypoglycemia.

Example 18 Comparative Insulin Tolerance for Insulin Prodrug Analogs

Normal mice were administered either an insulin heterodimer analog[B¹(Y16,L17,Y25)29a: A¹(aF19-NH2)], or a prodrug derivative thereof. Theprodrug derivative [B¹(Y16,L17,Y25)29a: A¹(aF19-dLys(Ac),NLeu)]comprises a 4-amino-phenylalanine substitution at position A19 wherein adipeptide dLys(Ac),NLeu have been covalently linked at the 4-aminoposition of the A19 residue. This dipeptide will autocleave underphysiological conditions with a half life of approximately 5 hours.After incubating the prodrug derivative [B¹(Y16,L17,Y25)29a:A¹(aF19-dLys(Ac),NLeu)] for 24 hours ex vivo, the resultant compound wasadministered to mice and it ability to lower blood glucose was comparedto parent compound. As shown in FIG. 38 the two compounds performedalmost identically.

Example 19 Pegylated Low Potency Alanine Analogs

The duration of action of the various insulin analogs disclosed hereincan be increased by decreasing their activity at the insulin receptor.Accordingly, in one embodiment the insulin analogs disclosed herein canbe modified to decrease their potency at the insulin receptor, includingmodification by 1 to 8, 1 to 5, 1 to 3, 1 to 2 or 1 amino acidsubstitution. In one embodiment the amino acid substitution is analanine substitution at a position selected from the group consisting ofB5, B10, B24, A1 or A8. Alanine substitutions at one or more of thesepositions substantially reduces potency, thus extending the duration ofaction at the insulin receptors. In one embodiment an insulin analog asdisclosed herein is further modified by a single alanine amino acidsubstitution at position B5, B24, A1 or A8. These compounds can befurther modified by pegylation as indicated in Table 18 (GE₅W=GEEEEEW, apeptide added to the N-terminus of the insulin analog to increasesolubility).

TABLE 18 IGF- Name Sequence IR-B IR-A 1 R MIU-35 B¹(H5, 10Y16L17)25-C¹-17.4% 61.4% 3.2% A¹(H8N18, 21) GE₅W- GE₅W-B¹(A5H10Y16L17)25-C¹- 2.3%8.6% 0.3% Ala, B5 A¹(H8N18N21) Ala, B5 B¹(A5H10Y16L17)25-C¹- 5.7% 2.5%A¹(H8N18N21) Ala, B24 B¹(H5, 10Y16L17A24)25-C¹- 0.4% 0.1% 0.3% A¹(H8N18,N21) GE₅W- GE₅W-B¹(H5, 10Y16L17)25-C¹- 0.7% 2.1% 0.5% Ala, A1A¹(A1H8N18, 21) Thr, A8 B¹(H5, 10Y16L17)25-C¹- 8.4% 20.4% 3.7% A¹(T8N18,21) PEGylated Analogs MIU-57 B1-PEG20K MIU-35 1.1% 4.5% 1.2% B1-PEG20K(GE₅W)- MIU-35 0.1% 0.3% Ala, A1

As shown in Table 19, single chain and two chain insulin analogs havebeen prepared and tested in vitro for activity at the insulin and IGF-1receptors and compared to their pegylated derivatives. Non-pegylatedforms have higher activity relative to the pegylated derivatives.Furthermore, dipegylating two chain insulin analogs using two 10 kDa PEGchains produces compounds of approximately similar activity relative tothe same analog comprising a single 20 kDa PEG chain (see the relativeactivities of B¹(H5,10Y16L17K29)29: A¹(H8,N18,21) relative to B1,A14-10KB¹(H5,10Y16L17R29)29: A¹(H8K14N18,21) and B¹(H5,10Y16L17K29)29:A¹(H8N18,21). For the single chain analogB¹(H5,10Y16L17K29)29-A¹(H8,N18,21) the addition of a 20 kDa produces acompound (B¹(H5,10Y16L17K29)29-A¹(H8N18,21) having almost 100 foldactivity at the insulin type-A receptor. Accordingly, by preparinginsulin analogs as two chain or single chain analogs and by selectingthe size, number and site of attachment of a PEG chain, the in vivopotency of the insulin analog can be modified, and presumably the invivo duration of action.

TABLE 19 PEGylation of Two-chain IGF-1 Analogs Analog Name Sequence IR-AIR-B IGF-1R Parent Peptide Backbones MIU-43 DP8Mut3 B¹(H5,10Y16L17R29)30-C¹des9-12-A¹(H8, N18, 21) 97.5% 16.7% 14.2% DP8Mut3KA14B¹(H5, 10Y16L17R29)30-C¹des9-12-A¹(H8, K14, N18, 21) 132.2% 12.6%DP3(SC) B¹(H5, 10Y16L17K29)29-A¹(H8, N18, 21) 0.03% DP3(TC) B¹(H5,10Y16L17K29)29: A¹(H8, N18, 21) 159.8% 33.1% PEGylated Analogs MIU-79di-10K-SC B1, A14-10K B¹(H5, 10Y16L17R29)30-C¹des9-12-A¹(H8, K14, N18,21) 1.7% 0.2% di-10K-TC B1, A14-10K B¹(H5, 10Y16L17R29)29: A¹(H8K14N18,21) 6.4% 2.1% MIU-77 mono-20K-SC B1-20K B¹(H5, 10Y16L17K29)29-A¹(H8N18,21) 0.1% MIU-78 mono-20K-TC B1-20K B¹(H5, 10Y16L17K29)29: A¹(H8N18, 21)8.2% 3.2%

The invention claimed is:
 1. A single chain insulin agonist analogcomprising the general structure B-LM-A wherein B represents an insulinB chain comprising the sequenceR₂₂-X₂₅LCGX₂₉X₃₀LVX₃₃X₃₄LX₃₆LVCGX₄₁X₄₂GFX₄₅ (SEQ ID NO: 20); Arepresents an insulin A chain comprising the sequenceGIVX₄X₅CCX₈X₉X₁₀CX₁₂LX₁₄X₁₅LX₁₇X₁₈X₁₉CX₂₁—R₁₃ (SEQ ID NO: 22); and LMrepresents a linking moiety linking the carboxy terminus of the B chainto the amino terminus of the A chain; further wherein the linking moietyis an 8 amino acid sequence consisting of the sequence X₅₁X₅₂GSSSX₅₇X₅₈(SEQ ID NO: 29), wherein X₅₁ is selected from the group consisting ofglycine, alanine, valine, leucine, isoleucine and proline; X₅₂ is anyamino acid other than tyrosine; X₅₇ and X₅₈ are independently selectedfrom the group consisting of arginine, lysine and ornithine; X₄ isglutamic acid or aspartic acid; X₅ is glutamine or glutamic acid; X₈ ishistidine, threonine or phenylalanine; X₉ is serine, arginine, lysine,ornithine or alanine; X₁₀ is isoleucine or serine; X₁₂ is serine oraspartic acid; X₁₄ is tyrosine, arginine, lysine, ornithine or alanine;X₁₅ is glutamine, glutamic acid, arginine, alanine, lysine, ornithine orleucine; X₁₇ is glutamine, glutamic acid, arginine, aspartic acid orlysine, ornithine; X₁₈ is methionine, asparagine, glutamine, asparticacid, glutamic acid or threonine; X₁₉ is tyrosine,4-methoxy-phenylalanine or 4-amino phenylalanine; X₂₁ is selected fromthe group consisting of alanine, glycine, serine, valine, threonine,isoleucine, leucine, glutamine, glutamic acid, asparagine, asparticacid, histidine, tryptophan, tyrosine, and methionine; X₂₅ is selectedfrom the group consisting of histidine and threonine; X₂₉ is selectedfrom the group consisting of alanine, glycine and serine; X₃₀ isselected from the group consisting of histidine, aspartic acid, glutamicacid, homocysteic acid and cysteic acid; X₃₃ is selected from the groupconsisting of aspartic acid and glutamic acid; X₃₄ is selected from thegroup consisting of alanine and threonine; X₃₆ is tyrosine; X₄₁ isselected from the group consisting of glutamic acid, aspartic acid orasparagine; X₄₂ is selected from the group consisting of alanine,ornithine, lysine and arginine; X₄₅ is selected from the groupconsisting of tyrosine, histidine, asparagine and phenylalanine; whereinR₁₃ is COOH or CONH₂; and R₂₂ is selected from the group consisting ofAYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), atripeptide glycine-proline-glutamic acid, a tripeptidevaline-asparagine-glutamine, a dipeptide proline-glutamic acid, adipeptide asparagine-glutamine, glutamine, glutamic acid and anN-terminal amine.
 2. The insulin analog of claim 1 wherein the linkingmoiety consists of the sequence X₅₁X₅₂GSSSRR (SEQ ID NO: 27).
 3. Theinsulin analog of claim 1 wherein X₅₂ is alanine, valine, leucine,isoleucine or proline.
 4. The insulin analog of claim 1 wherein the Achain comprises the sequence GIVEQCCTSICSLYQLENYCN (SEQ ID NO: 1), or asequence having 95% sequence identity with SEQ ID NO: 1, and the Bchains comprises the sequence FVNQHLCGSHLVEALYLVCGERGFFYTPKT (SEQ ID NO:2) or a sequence having 95% sequence identity with SEQ ID NO:
 2. 5. Asingle chain insulin agonist analog comprising the general structureB-LM-A wherein B represents an insulin B chain comprising the sequenceX₂₂VNQX₂₅LCGX₂₉X₃₀LVEALYLVCGERGFFYT-Z₁-B₁ (SEQ ID NO: 54); A representsan insulin A chain comprising the sequence GIVEQCCX₈SICSLYQLX₁₇NX₁₉CX₂₃(SEQ ID NO: 52); and LM represents a linking moiety linking the carboxyterminus of the B chain to the amino terminus of the A chain, whereinsaid linking moiety is an 8 amino acid sequence consisting of thesequence the sequence GAGSSSRR (SEQ ID NO: 32) or a sequence thatdiffers from GAGSSSRR (SEQ ID NO: 32) by 1 or 2 amino acidsubstitutions; further wherein X₈ is selected from the group consistingof threonine and histidine; X₁₇ is glutamic acid or glutamine; X₁₉ istyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine; X₂₂ isselected from the group consisting of phenylalanine anddesamino-phenylalanine; X₂₃ is asparagine or glycine; X₂₅ is selectedfrom the group consisting of histidine and threonine; X₂₉ is selectedfrom the group consisting of alanine, glycine and serine; X₃₀ isselected from the group consisting of histidine, aspartic acid, glutamicacid, homocysteic acid and cysteic acid; Z₁ is a dipeptide selected fromthe group consisting of aspartate-lysine, lysine-proline, andproline-lysine; and B₁ is selected from the group consisting ofthreonine, alanine or a threonine-arginine-arginine tripeptide.
 6. Theinsulin analog of claim 5 wherein the linking moiety consists of thesequence GAGSSSRR (SEQ ID NO: 32) or GYGSSSRR (SEQ ID NO: 18).
 7. Theinsulin analog of claim 5 wherein the linking moiety consists of thesequence GAGSSSRR (SEQ ID NO: 32).
 8. The insulin analog of claim 5,wherein the A chain comprises the sequence GIVEQCCTSICSLYQLENYCN (SEQ IDNO: 1), and the B chain comprises the sequenceFVNQHLCGSHLVEALYLVCGERGFFYTPKT (SEQ ID NO: 2).
 9. The single chaininsulin agonist analog of claim 1 wherein A represents an insulin Achain comprising the sequence GIVDECCX₈RSCDLYQLENX₁₉CN—R₁₃ (SEQ ID NO:195) or GIVEQCCTSICSLYQLENYCN (SEQ ID NO: 1); B represents an insulin Bchain comprising the sequence GPEX₂₅LCGSHLVDALYLVCGDX₄₂GFX₄₅ (SEQ ID NO:196) or FVNQHLCGSHLVEALYLVCGERGFFYTPKT (SEQ ID NO: 2), wherein X₈ isthreonine, histidine or phenylalanine; X₁₉ is tyrosine,4-methoxy-phenylalanine or 4-amino phenylalanine; X₂₅ is histidine orthreonine; X₄₂ is alanine, ornithine or arginine; X₄₅ is tyrosinehistidine, asparagine or phenylalanine; X₅₈ is arginine, lysine,cysteine, homocysteine, acetyl-phenylalanine or ornithine; and R₁₃ isCOOH or CONH₂; and LM represents a linking moiety consisting of thesequence GAGSSSRR (SEQ ID NO: 32) or an amino acid that differs from SEQID NO: 32 by a single amino acid substitution at a position other thanposition C2.
 10. A single chain insulin agonist analog comprising thegeneral structure B-LM-A wherein B represents an insulin B chaincomprising the sequence X₂₂VNQX₂₅LCGX₂₉X₃₀LVEALYLVCGERGFFYT-Z₁-B₁ (SEQID NO: 54); A represents an insulin A chain comprising the sequenceGIVEQCCX₈SICSLYQLX₁₇NX₁₉CX₂₃ (SEQ ID NO: 52); and LM represents alinking moiety linking the carboxy terminus of the B chain to the aminoterminus of the A chain, wherein said linking moiety is an 8 amino acidsequence consisting of the sequence X₅₁X₅₂X₅₃X₅₄X₅₅SRR (SEQ ID NO: 26)wherein X₅₁ is selected from the group consisting of glycine, alanine,valine, leucine, isoleucine and proline; X₅₂ is any amino acid otherthan tyrosine; X₅₃, X₅₄, and X₅₅ are independently selected from thegroup consisting of glycine, alanine, serine, threonine and proline;further wherein X₈ is selected from the group consisting of threonineand histidine; X₁₇ is glutamic acid or glutamine; X₁₉ is tyrosine,4-methoxy-phenylalanine or 4-amino phenylalanine; X₂₂ is selected fromthe group consisting of phenylalanine and desamino-phenylalanine; X₂₃ isasparagine or glycine; X₂₅ is selected from the group consisting ofhistidine and threonine; X₂₉ is selected from the group consisting ofalanine, glycine and serine; X₃₀ is selected from the group consistingof histidine, aspartic acid, glutamic acid, homocysteic acid and cysteicacid; Z₁ is a dipeptide selected from the group consisting ofaspartate-lysine, lysine-proline, and proline-lysine; and B₁ is selectedfrom the group consisting of threonine, alanine or athreonine-arginine-arginine tripeptide.
 11. The insulin analog of claim10, wherein the linking moiety is an 8 amino acid sequence consisting ofthe sequence X₅₁X₅₂GSSSRR (SEQ ID NO: 27) or a sequence that differsfrom SEQ ID NO: 27 by a single amino acid substitution, wherein X₅₁ isselected from the group consisting of glycine, alanine, valine, leucine,isoleucine and proline; and X₅₂ is alanine, valine, leucine, isoleucineor proline.
 12. The insulin analog of claim 11, wherein the A chaincomprises the sequence GIVEQCCTSICSLYQLENYCN (SEQ ID NO: 1), and the Bchain comprises the sequence FVNQHLCGSHLVEALYLVCGERGFFYTPKT (SEQ ID NO:2).
 13. The insulin analog of claim 12, wherein the linking moiety is an8 amino acid sequence consisting of the sequence X₅₁X₅₂GSSSRR (SEQ IDNO: 27), wherein X₅₁ is glycine; and X₅₂ is proline.